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PROTEIN INVOLVED IN CANCER 

The present invention relates to the use of a protein (NKCC1) in the diagnosis, screening, 
treatment and prophylaxis of breast, lung and/or pancreatic cancer, compositions comprising the protein, 
5 including vaccines, and antibodies that are immunospecific for the protein. 

The major challenges in breast, lung and/or pancreatic cancer treatment is to improve early 
detection rates, to find new non-invasive markers that can be used to follow disease progression and 
identify relapse, and to find improved and less toxic therapies, especially for more advanced disease 
where 5 year survival is still very poor. There is a great need to identify targets which are more specific 
10 to the cancer cells, ideally ones which are expressed on the surface of the tumour cells so that they can be 
attacked by promising new approaches like immunotherapeutics and targeted toxins. 

An ideal protein target for cancer immunotherapy should have a restricted expression profile in 
normal tissues and be over-expressed in tumours, such that the immune response will be targeted to 
tumour cells and not against other organs. In addition, the protein target should be exposed on the cell 
15 surface, where it will be accessible to therapeutic Tumour specific proteins have been identified for a 

number of cancer types, by using techniques such as differential screening of cDNA (Hubert, R.S., et al., 
1999, Proc. Natl Acad, Sci. USA 96, 14523-14528; Lucas, S., et ah Int. J. Cancer 87, 55-60 (2000)), and 
the purification of cell-surface proteins that are recognised by tumour-specific antibodies (Catimel, B., et 
al J. Biol. Chem. 271, 25664-25670 (1996)). 
20 Recently, DNA 'chips' containing up to 10,000 expressed sequence elements have been used to 

characterise tumour cell gene expression Dhanasekaran, S.M., et al., 2001, Nature 412, 822-826). 
However, there are several reasons why the numerous and extensive previous transcriptomic analysis of 
breast cancer may not have revealed all, or even most, tumour associated proteins. These include; (i) a 
lack of correlation between transcript and disease-associated protein levels, particularly common for 
25 membrane proteins that often have a long half-life and as such do not have a high mRNA turnover. 

Therefore, whilst the difference in protein levels between normal and cancerous cells are consistent it is 
often difficult to associate changes in the mRNA for a given membrane protein with the cancerous state, 
(ii) Translocation of a protein in the disease state rather than simply differential levels of the transcript, 
for example, erbB2/HER2-neu, shows much greater plasma-membrane localisation in cancer cells than 
30 normal breast cells, and the transcription factors oestrogen receptor and STAT3 translocate to the nucleus 
to exert their tumourigenic effects, (iii) Novel/uncharacterised genes are not highly represented within 
the 'closed system' of a cDNA array where there are restrictions on the number of expressed sequence 
elements per chip and the knowledge and availability of DNA clones. 

NKCC1 (the sequence of which can be found under the accession number P5501 1 in the 
35 SwissProt database, (available at http://www.expasy.org/), and which is shown in Figure 1, SEQ ID 

NO:l) is a bumetanide sensitive sodium-potassium-chloride (Na-K-Cl) cotransporter (Payne et al, (1995), 
J Biol Chem, 270(30): ppl7977-17985). Two isoforms of the Na-K-Cl cotransporter have been identified; 
one located on the apical membrane of absorptive epithelia (NKCC2) and one located on the basolateral 
membrane of secretory epithelia (NKCC1). The function of the Na-K-Cl cotransporters is to provide 
40 electroneutral transport of chloride ions across epithelia (in a ratio of INa: 1K:2C1), they work in 
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combination with the sodium and potassium channels and the sodium pump to cause a net transport of 
sodium chloride across membranes. Dysregulation of this transport mechanism can result in diseases 
such as Cystic Fibrosis and secretory diarrhoea. 

The human NKCC1 protein was identified by the Payne et ql in 1995 (see above) by the 
5 screening of a human colon carcinoma cell line (T84). It shows significant similarity to the NKCC1 
proteins identified from elasmobranch, mouse, rat, rabbit and flounder. 

Both isoforms of the Na-K-Cl cotransporter show a similar structure with large N and C terminal 
domains and 12 transmembrane segments. NKCC1 is significantly larger than NKCC2 with an extra 80 
amino acids at the N terminus, interestingly it is the N terminus that shows the greatest variation between 
10 species indicating that this section may not be directly involved in the ion transportation. A recent study 
has indicated that NKCC1 may have 2 splice variants which show differential expression across a variety 
of tissue types indicating that differential splicing may play a regulatory role in NKCC1 activity (Vibat et 
al Anal Biochem 2001;298(2):218-30). 

Overexpression of NKCC1 has been reported in asthmatic subjects by gene expression profiling 
15 (Dolganov GM et ah Genome Res (2001); 1 1(9): 1473-83). This data was then confirmed using 

immunohistochemistry, which demonstrated that in asthmatic subjects NKCC1 expression is increased 
with restricted localisation to goblet cells implicating NKCC1 in the pathophysiology of mucus 
hypersecretion in asthma. 

However, the prior art does not show a cancer-associated alteration of the NKCC1 protein and 
20 therefore, does not show the usefulness of NKCC1 in a therapeutic treatment approach to breast, lung 
and/or pancreatic cancer cells. 

The present invention is based on the finding that the expression of NKCC1 protein is increased 
in sample of tissue from breast, lung and/or pancreatic cancer and NKCC1 mRNA shows restricted 
expression in a few tissues with elevated expression in breast, lung and/or pancreatic cancer samples, 
25 suggesting its utility as a target for breast, lung and/or pancreatic cancer therapy and diagnosis. 

Thus, one aspect of the present invention provides a method of screening for and/or diagnosis of 
breast, lung and/or pancreatic cancer in a subject and/or monitoring the effectiveness of breast, lung 
and/or pancreatic cancer therapy, which method comprises the step of detecting and/or quantifying in a 
biological sample obtained from said subject, an NKCC1 polypeptide which: 
30 a) comprises or consists of the amino acid sequence shown in Figure 1 (SEQ ID NO: 1); 

b) is a derivative having one or more amino acid substitutions, modifications, deletions or 
insertions relative to the amino acid sequence shown in Figure 1 (SEQ ID NO: 1) which 
retains the biological activity of NKCC1; or 

c) is a fragment of a polypeptide having the sequence shown in Figure 1 (SEQ ID NO: 1), 
35 which is at least ten amino acids long and has at least 70% sequence identity over the 

length of the fragment; 
The polypeptides described in a) to c) above are hereinafter referred to as 'TSfKCCl 
polypeptides". The term "polypeptides" includes peptides, polypeptides and proteins. These are used 
interchangeably unless otherwise specified. 
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In another aspect, the invention provides a method of screening for and/or diagnosis of breast, lung 
and/or pancreatic cancer in a subject, which method comprises the step of detecting and/or quantifying in a 
biological sample obtained from said subject, an NKCC1 nucleic acid which: 

d) comprises or consists of the DNA sequence shown in Figure 1 (SEQ ID NO:2) or its 
RN A equivalent; 

e) a sequence which codes for a derivative or fragment of an amino acid molecule as defined in 
a) orb) (SEQ ID NO: 1). 

f) a sequence which is complementary to die sequences of d) or e); 

g) a sequence which codes for the same polypeptide, as the sequences of d) or e); or 
10 h) a sequence which shows substantial identity with any of those of d), e), f) or g). 

i) is a fragment of d), e), f), g) or h), which is at least 8 nucleotides in length. 
The nucleic acid molecules described in d) to i) above are hereinafter referred to as "NKCC1 
nucleic £lcicLs*' 

In another aspect, the present invention provides a method for the prophylaxis and/or treatment of 
15 breast, lung and/or pancreatic cancer in a subject, which comprises administering to said subject a 

therapeutically effective amount of at least one NKCC1 polypeptide or at least one NKCC1 nucleic acid. 

In a yet another aspect, the present invention provides the use of at least one NKCC1 polypeptide 
or at least one NKCC1 nucleic acid in the preparation of a composition for use in the prophylaxis and/or 
treatment of breast, lung and/or pancreatic cancer. The subject may be a mammal and is preferably a 

20 human. _ 

Li the aspects supra, the NKCC1 polypeptides may be provided in isolated or recombinant form, and 
may be fused to other moieties. The NKCC1 polypeptides thereof may be provided in substantially pure 
form, that is to say free, to a substantial extent, from other proteins. Thus, a polypeptide for use in the present 
invention may be provided in a composition in which it is the predominant component present (i.e. it is 
25 present at a level of at least 50%; preferably at least 75%, at least 80%, at least 85%, at least 90%, at least 
95% or at least 98%; when determined on a weight/weight basis excluding solvents or carriers). 

In order to more fully appreciate the present invention, polypeptides within the scope of a)-c) above 
will now be discussed in greater detail. 

30 Polypeptides within the scope of a) 

A polypeptide within the scope of a), may consist of the particular amino acid sequence given in 
Figure 1 or may have an additional N-terminal and/or an additional C-terminal amino acid sequence relative 

to the sequence given in Figure 1 . 

Additional N-terminal or C-terminal sequences may be provided for various reasons. Techniques for 
35 providing such additional sequences are well known in the art. 

Additional sequences may be provided in order to alter the characteristics of a particular polypeptide. 
This can be useful in improving expression or regulation of expression in particular expression systems. For 
example, an additional sequence may provide some protection against proteolytic cleavage. 

Additional sequences can also be useful in altering the properties of a polypeptide to aid in 
40 identification or purification. For example, a fusion protein may be provided in which a polypeptide is linked 
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to a moiety capable of being isolated by affinity chromatography. The moiety may be an antigen or an 
epitope and the affinity column may comprise immobilised antibodies or immobilised antibody fragments 
which bind to said antigen or epitope (desirably with a high degree of specificity). The fusion protem can 
usually be eluted from the column by addition of an appropriate buffer.. 

5 Additional N-terminal or C-terminal sequences may, however, be present simply as a result of a 

particular technique used to obtain a polypeptide and need not provide any particular advantageous 
characteristic to the polypeptide. Such polypeptides are within the scope of the present invention. 

Whatever additional N-terminal or C-terminal sequence is present, it is preferred that the resultant 
polypeptide should exhibit the immunological or biological activity of the polypeptide having the ammo acid 

10 sequence shown in Figure 1 (SEQ ID NO: 1). 

Pnlvnentides within the scope of b) . 

Turning now to the polypeptides defined in b) above, it will be appreciated by the person skilled in 
the art that these polypeptides are derivatives of the polypeptide given in a) above, provided that such 
15 derivatives preferably exhibit the immunological or biological activity of the polypeptide having the amino 
acid sequence shown in Figure 1. Alternatively, the biological activity ofthe polypeptide may be altered. As 
such, it will be appreciated by one skilled in the art that derivatives can include post-translational 
modifications, for example but without limitation, phosphorylation, glycosylation and farnesylation. 

As used herein, the term "derivative" refers to a polypeptide that comprises an ammo acid 
20 sequence of a parent polypeptide that has been altered by the introduction of amino acid residue 
substitutions, deletions or additions, and/or amino acid modifications such as but not limited to, 
phosphorylation and glycosylation. The derivative polypeptide retains the activity of the parent 

polypeptide. . . - 

Alterations in the amino acid sequence of a protein can occur which do not affect the activity of a 
25 protein. These include amino acid deletions, insertions and substitutions and can result from alternative 
splicing and/or the presence of multiple translation start sites and stop sites. Polymorphisms may arise as 
a result of the infidelity of the translation process. Thus changes in amino acid sequence may be tolerated 
which do not affect the protein's biological or immunological activity. 

In various aspects ofthe present invention, for example therapy and/or prophylaxis of breast, lung 
30 and/or pancreatic cancer the use of dominant negative or constitutively active NKCC1 polypeptides is 
contemplated. Therefore, in one embodiment, the deleted, inserted or substituted amino acid(s) renders 
dominant negative activity upon the peptide. In another embodiment, the deleted, inserted or substituted 
amino acid(s) renders the polypeptide constitutively active. 

The skilled person will appreciate that various changes can often be made to the amino acid sequence 
35 of a polypeptide which has a particular activity to produce derivatives (sometimes known as variants or 

"muteins") having at least a proportion of said activity, and preferably having a substantial proportion of said 
activity. Such derivatives of the polypeptides described in a) above are within the scope of the present 
invention and are discussed in greater detail below. They include allelic and non-allelic derivatives. 

An example of a derivative of an NKCCI polypeptide is a polypeptide as defined in a) above, apart 
40 frommesubstimtionofoneormoreammoacidswimoneormoreomeraminoacids. The skilled person is 
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aware that various amino acids have similar properties. One or more such amino acids of a polypeptide can 
often be substituted by one or more other such amino acids without eliminating a desired activity of that 

polypeptide^ ^ ^ glycme> ^mme, valine, leucine and isoleucine can often be substituted for one 
5 another (amnio acids having aliphatic side chains). Of these possible substitutions, it is preferred that glycine 
and alanine are used to substitute for one another (since they have relatively short side chains) and that valine, 
leucine and isoleucine are used to substitute for one another (since they have larger aliphatic side chains 

which are hydrophobic). 

Other amino acids which can often be substituted for one another include: 
10 - phenylalanine, tyrosine and tryptophan (amino acids having aromatic side chains); 

- lysine, arginine and histidine (amino acids having basic side chains); 

- aspartate and glutamate (amino acids having acidic side chains); 

- asparagine and glutamine (amino acids having amide side chains); 

- cysteine and methionine (amino acids having sulphur-containing side chains); and 

15 - aspartic acid and glutamic acid can substitute for phospho-serine and phospho-threonine, 

respectively (amino acids with acidic side chains). 
Substitutions of this nature are often referred to as "conservative" or "semi-conservative" ammo acid 

substitutions. . . . . » 

Amino acid deletions or insertions may also be made relative to the ammo acid sequence given m a) 
20 above. Thus, for example, amino acids which do not have a substantial effect on the biological and/or 

immunological activity of the polypeptide, or at least which do not eliminate such activity, may be deleted. 
Such deletions can be advantageous since the overall length and the molecular weight of a polypeptide can be 
reduced whilst still retaining activity. This can enable the amount of polypeptide required for a particular 
purpose to be reduced for example, dosage levels can be reduced. 
25 " Amino acid insertions relative to the sequence given in a) above can also be made. This may be 
done to alter the properties of a polypeptide used in the present invention (e.g. to assist in identification, 
purification or expression, as explained above in relation to fusion proteins). 

Amino acid changes relative to the sequence given in a) above can be made using any suitable 
technique e.g. by using site-directed mutagenesis (Hutchinson et al., 1978, J. Biol. Chem. 253:6551). 
30 It should be appreciated that amino acid substitutions or insertions to the polypeptide for use m the 

present invention can be made using naturally occurring or non-naturally occurring amino acids. Whether or 
not natural or synthetic amino acids are used, it is preferred that only L-amino acids are present 

Whatever amino acid changes are made (whether by means of substitution, modification, 
insertion or deletion), preferred NKCC1 polypeptides have at least 50% sequence identity with a 
35 polypeptide as defined in a) above, more preferably the degree of sequence identity is at least 75%, at 
least 80%, at least 85%. Sequence identities of at least 90%, at least 95% or at least 98% are most 

preferred^ ^ ^ ^ ^ tQ describe ±e siiralar ity between two polypeptide sequences. The 

degree of amino acid sequence identity can be calculated using a program such as "bestfit" (Smith and 
40 Waterman, Advances in Applied Mathematics, 482-489 (1981)) to find the best segment of similarity 
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between any two sequences. The alignment is based on maximising the score achieved using a matrix of 
amino acid similarities, such as that described by Schwarz and Dayhof (1979) Adas of Protein Sequence 
and Structure, Dayhof, M.O., Ed pp 353-358. 

A software package well known in the art for carrying out this procedure is the CLUSTAL 
program. It compares the amino acid sequences of two polypeptides and finds the optimal alignment by 
inserting spaces in either sequence as appropriate. The amino acid identity or similarity (identity plus 
conservation of amino acid type) for an optimal alignment can also be calculated using a software 
package such as BLASTX. This program aligns the largest stretch of similar sequence and assigns a 
value to the fit For any one pattern comparison, several regions of similarity may be found, each having 
a different score. One skilled in the art will appreciate that two polypeptides of different lengths may be 
compared over the entire length of the longer fragment. Alternatively small regions may be compared. 
Normally sequences of the same length are compared for a useful comparison to be made. 

Where high degrees of sequence identity are present there will be relatively few differences in amino 
acid sequence. Thus for example they may be less than 20, less than 10, or even less than 5 differences. 

Polypeptides within the scone of c) 

As discussed supra, it is often advantageous to reduce the length of a polypeptide, preferably the 
resultant reduced length polypeptide still has a desired activity or can give rise to useful antibodies. Feature 
c) therefore covers fragments of polypeptides a) or b) above for use in the present invention. 

As used herein, the term "fragment" refers to a polypeptide comprising an amino acid sequence of 
at least 10 amino acid residues (preferably, at least 15 amino acid residues, at least 20 amino acid 
residues, at least 25 amino acid residues, at least 40 amino acid residues, at least 50 amino acid residues, 
at least 60 amino residues, at least 70 amino acid residues, at least 80 amino acid residues, at least 90 
amino acid residues, at least 100 amino acid residues, at least 125 amino acid residues, at least 150 amino 
acid residues, at least 175 amino acid residues, at least 200 amino acid residues, at least 250 amino acid 
residues, at least 300 amino acid residues, at least 350 amino acid residues, at least 400 amino acid 
residues, at least 450 amino acid residues, at least 500 amino acid residues, at least 550 amino acid 
residues, at least 600 amino acid residues, at least 650 amino acid residues, at least 700 amino acid 
residues, at least 750 amino acid residues, at least 800 amino acid residues, at least 850 amino acid 
residues, at least 900 amino acid residues, at least 950 amino acid residues, at least 1000 amino acid 
residues, at least 1050 amino acid residues, at least 1 100 amino acid residues, at least 1 150 amino acid 
residues or at least 1200 amino acid residues) of the amino acid sequence of a parent polypeptide. Any 
given fragment of the NKCC1 polypeptide may or may not possess the functional activity of the parent 
polypeptide. 

The skilled person can determine whether or not a particular fragment has activity. 

An NKCC1 polypeptide may be useful as antigenic material, and may be used in the production of 
vaccines for treatment or prophylaxis of breast, lung and/or pancreatic cancer. Such material can be 
"antigenic" and/or "immunogenic". Generally, "antigenic" is taken to mean that the protein is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in a subject 
"Immunogenic" is taken to mean that the protein is capable of eliciting an immune response in a subject. 
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Thus, in the latter case, the protein may be capable of not only generating an antibody response but, in 
addition, non-antibody based immune responses. 

It is well known that is possible to screen an antigenic protein or polypeptide to identify epitopic 
regions, i.e. those regions which are responsible for the protein or polypeptide's antigenicity or 
immunogenicity. Methods well known to the skilled person can be used to test fragments and/or 
homologues and/or derivatives for antigenicity. Thus, the fragments for use in the present invention may 
include one or more such epitopic regions or be sufficiently similar to such regions to retain their 
antigenic/immunogenic properties. Thus, for fragments for use according to the present invention the degree 
of identity is perhaps irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue may be that the fragment retains the 
antigemc/immunogenic properties of the protein from which it is derived. 

Homologues, derivatives and fragments may possess at least a degree of the 
antigenicity/immunogenicity of the protein from which they are derived. 

As will be discussed below, NKCC1 polypeptides find use in an immunotherapeutic approach to 
breast, lung and/or pancreatic cancer. The skilled person will appreciate that for the preparation of one or 
more such polypeptides, the preferred approach will be based on recombinant DNA techniques. 

The NKCC1 polypeptides can be coded for by a large variety of nucleic acid molecules, taking into 
account the well-known degeneracy of the genetic code. All of these molecules can be used in the present 
invention. They can be inserted into vectors and cloned to provide large amounts of DNA or RNA for further 
study. Suitable vectors may be introduced into host cells to enable the expression of polypeptides used in the 
present invention using techniques known to the person skilled in the art. 

Techniques for cloning, expressing and purifying proteins and polypeptides are well known to the 
skilled person. DNA constructs can readily be generated using methods well known in the art These 
techniques are disclosed, for example in J. Sambrook et al, Molecular Cloning 2 nd Edition, Cold Spring 
Harbour Laboratory Press (1989); in Old & Primrose Principles of Gene Manipulation 5th Edition, 
Blackwell Scientific Publications (1994); and in Stryer Biochemistry 4th Edition, W H Freeman and 
Company (1995). Modifications of DNA constructs and the proteins expressed such as the addition of 
promoters, enhancers, signal sequences, leader sequences, translation start and stop signals and DNA 
stability controlling regions, or the addition of fusion partners may then be facilitated. 

Normally the DNA construct will be inserted into a vector, which may be of phage or plasmid 
origin. Expression of the protein is achieved by the transformation or transfection of the vector into a 
host cell, which may be of eukaryotic or prokaryotic origin. Such vectors and suitable host cells form 
further aspects for use in the present invention. 

The NKCC1 nucleic acids for use in the present invention, including DNA and RNA, and 
comprising a sequence encoding an NKCC1 polypeptide, may be synthesised using methods known in the 
art, such as using conventional chemical approaches or polymerase chain reaction (PCR) amplification. 
The NKCC1 nucleic acids also permit the identification and cloning of the gene encoding an NKCC1 
polypeptide from any species, for instance by screening cDNA libraries, genomic libraries or expression 
libraries. 
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Knowledge of the nucleic acid structure can be used to raise antibodies and for gene therapy. 
Techniques for this are well-known by those skilled in the art, as discussed in more detail herein. 

By using appropriate expression systems, polypeptides for use in the present invention may be 
expressed in glycosylated or non-glycosylated form Non-glycosylated forms can be pioduced by expression 

in prokaryotic hosts, such as E. coll 

Polypeptides comprising N-terminal methionine may be produced using certain expression systems, 

whilst mothers the mature polypeptide will lack this residue. 

Preferred techniques for cloning, expressing and purifying an NKCC1 polypeptide are summarised 



below: 



l o Polypeptides may be prepared natively or under denaturing conditions and then subsequently 

refolded. Baculoviral expression vectors include secretory plasmids (such as pACGP67 from 
Pharmingen), which may have an epitope tag sequence cloned in frame (e.g. myc, V5 or His) to aid 
detection and allow for subsequent purification of the protein. Mammalian expression vectors may 
include pCDNA3 and pSecTag (both Invitrogen), and pREP9 and pCEP4 (invitrogen). E. coli systems 
15 include the pBad series (His tagged -Invitrogen) or pGex series (Pharmacia). 

In addition to nucleic acid molecules coding for NKCC1 polypeptides, referred to herein as coding 
nucleic acids, the present invention also utilises nucleic acids complementary thereto. Thus, for example, 
both strands of a double stranded nucleic acid molecule are included in the present invention (whether or not 
they are associated with one another). Also included are mRNA molecules and complementary DNA 

20 molecules (eg.cDNA molecules). . ., , mu 

The term identity can also be used to describe the similarity between two individual DNA 
sequences: The 'bestfit' program (Smith and Waterman, Advances in applied Mathematics, 482-489 
(198 1)) is one example of a type of computer software used to find the best segment of similarity between 
two nucleic acid sequences, whilst the GAP program enables sequences to be aligned along their whole 

25 length and finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 

preferred if sequences which show substantial identity with any of those of d), e) and f) have e.g. at least 
50%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% sequence 

identity ^ ^ ^ mole cules which can hybridise to any of the nucleic acid molecules discussed 

30 above, in the diagnosis, screening, treatment and/or prophylaxis of breast, lung and/or pancreatic, cancer is 
also covered by the present invention. Such nucleic acid molecules are referred to herein as hybndismg 
nucleic acid molecules. Hybridising nucleic acid molecules can be useful as probes or primers, for example. 

Desirably such hybridising molecules are at least 8 nucleotides in length and preferably are at least 
25 or at least 50 nucleotides in length. The hybridising nucleic acid molecules preferably hybridise to nucleic 
35 acids within the scope of d),e),f),g) or h) above specifically. 

Desirably the hybridising molecules will hybridise to such molecules under stringent hybridisation 
conditions. One example of stringent hybridisation conditions is where attempted hybridisation is earned out 
at a temperature of from about 35'C to about 65'C using a salt solution that is about 0.9 molar. However, the 
skilled person will be able to vary such conditions as appropriate in order to take into account variables such 
40 as probelength, base composition, type of ions present, etc. For a high degree of selectivity, relatively 
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stringent conditions are used to form the duplexes, such as low salt or high temperature conditions. As 
used herein, "highly stringent conditions" means hybridisation to filter-bound DNA in 0.5 M NaHP0 4 , 
7% sodium dodecyl sulphate (SDS), 1 mM EDTA at 65°C, and washing in 0.1xSSC/0. 1% SDS at 68°C 
(Ausubel F.M. et al., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green Publishing 
5 Associates, Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3). For some applications, less 

stringent conditions for duplex formation are required. As used herein "moderately stringent conditions" 
means washing in 0.2xSSC/0.1% SDS at 42°C (Ausubel et al 9 1989, supra). Hybridisation conditions 
can also be rendered more stringent by the addition of increasing amounts of formamide, to destabilise the 
hybrid duplex. Thus, particular hybridisation conditions can be readily manipulated, and will generally 
10 be chosen depending on the desired results. In general, convenient hybridisation temperatures in the 

presence of 50% formamide are: 42°C for a probe which is 95 to 100% identical to the fragment of a gene 
encoding a polypeptide as defined herein, 37°C for 90 to 95% identity and 32°C for 70 to 90% identity. 
In the preparation of genomic libraries, DNA fragments are generated, some of which will encode parts or 
the whole of an NKCC1 polypeptide. The DNA may be cleaved at specific sites using various restriction 
15 enzymes. Alternatively, one may use DNAse in the presence of manganese to fragment the DNA, or the 
DNA can be physically sheared, as for example, by sonication. The DNA fragments can then be 
separated according to size by standard techniques, including but not limited to agarose and 
polyacrylamide gel electrophoresis, column chromatography and sucrose gradient centrifugation. The 
DNA fragments can then be inserted into suitable vectors, including but not limited to plasmids, cosmids, 
20 bacteriophages lambda or T 4 , and yeast artificial chromosomes (YACs). (See, for example, Sambrook et 
oL, 1989, Molecular Cloning, A Laboratory Manual, ID Ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, 
Ltd., Oxford, U.K. Vol. I, H; Ausubel F.M. et al. 9 eds., 1989, Current Protocols in Molecular Biology, 
Vol. I, Green Publishing Associates, Inc., and John Wiley & sons, Inc., New York). The genomic library 
25 may be screened by nucleic acid hybridisation to labelled probe (Benton & Davis, 1977, Science 196: 180; 
Grunstein & Hogness, 1975, Proc. Natl Acad. Set U.S.A. 72:3961). 

Manipulation of the DNA encoding a protein is a particularly powerful technique for both 
modifying proteins and for generating large quantities of protein for purification purposes. This may 
involve the use of PCR techniques to amplify a desired nucleic acid sequence. Thus the sequence data 
30 provided herein can be used to design primers for use in PCR so that a desired sequence can be targeted and 
then amplified to a high degree. 

Typically, primers will be at least eight nucleotides long and will generally be at least ten nucleotides 
long (e.g. fifteen to twenty-five nucleotides long). In some cases, primers of at least thirty or at least thirty- 
five nucleotides in length may be used. 
35 As a further alternative chemical synthesis may be used. This may be automated. Relatively short 

sequences may be chemically synthesised and ligated together to provide a longer sequence. 

In addition to being used as primers and/or probes, hybridising nucleic acid molecules can be used as 
anti-sense molecules to alter the expression of NKCC1 polypeptides by binding to complementary nucleic 
acid molecules. This technique can be used in anti-sense therapy. 
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As used herein, an "antisense" nucleic acid refers to a nucleic acid capable of hybridising by 
virtue of some sequence complementarity to a portion of an RNA (preferably mRNA) encoding a 
polypeptide as defined herein. The antisense nucleic acid may be complementary to a coding and/or non- 
coding region of a mRNA encoding such a polypeptide. Such antisense nucleic acids have utility as 
5 compounds that inhibit expression, and can be used in the treatment or prevention of breast, lung and/or 
pancreatic cancer. 

In a specific embodiment, expression of an NKCC1 polypeptide is inhibited by use of antisense 
nucleic acids. The present invention provides the therapeutic or prophylactic use of nucleic acids 
comprising at least eight nucleotides that are antisense to a gene or cDNA encoding an NKCC1 
10 polypeptide. 

A hybridising nucleic acid molecule of use in the present invention may have a high degree ot 
sequence identity along its length with a nucleic acid molecule within the scope of d)-i) above (e.g. at least 
50%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% sequence identity). 
As will be appreciated by the skilled person, the higher the sequence identity a given single stranded nucleic 
15 acid molecule has with another nucleic acid molecule, the greater the likelihood that it will hybridise to a 
nucleic acid molecule which is complementary to that other nucleic acid molecule under appropriate 
conditions. 

The term 'RNA equivalent' when used above indicates that a given RNA molecule has a 
sequence which is complementary to that of a given DNA molecule, allowing for the fact that in RNA «U' 
20 replaces T* in the genetic code. The nucleic acid molecule may be in isolated, recombinant or chemically 
synthetic form. 

In view of the foregoing description the skilled person will appreciate that utilisation of a large 
number of nucleic acids are within the scope of the present invention. . Unless the context indicates 
otherwise, the term "NKCC1 nucleic acid" includes those nucleic acid molecules defined in d) to i) above 
25 and may have one or more of the following characteristics: 

1) they may be provided in recombinant form, e.g. covalently linked to a 5" and/or a 3' flanking 
sequence to provide a molecule which does not occur in nature; 

2) they may be provided without 5' and/or 3' flanking sequences which normally occur in 
nature; and 

30 3 ) they may be provided in substantially pure form. Thus they may be provided in a form 

which is substantially free from contaminating proteins and/or from other nucleic acids; 
If desired, a gene encoding an NKCC1 polypeptide, a related gene, or related nucleic acid 
sequences or subsequences, including complementary sequences, can also be used in hybridisation assays. 
A nucleotide encoding an NKCC1 polypeptide, or subsequences thereof comprising at least 8 nucleotides, 
35 can be used as a hybridisation probe. Hybridisation assays can be used for detection, prognosis, 

diagnosis, or monitoring of conditions, disorders, or disease states, associated with aberrant expression of 
genes encoding an NKCC1 polypeptide, or for differential diagnosis of patients with signs or symptoms 
suggestive of breast, lung and/or pancreatic cancer. In particular, such a hybridisation assay can be 
carried out by a method comprising contacting a patient sample containing nucleic acid with a nucleic 
40 acid probe capable of hybridising to a DNA or RNA that encodes an NKCC1 polypeptide, under 
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conditions such that hybridisation can occur, and detecting or measuring any resulting hybridisation. 
Nucleotides can be used for therapy of patients having breast, lung and/or pancreatic cancer, as described 
below. 

In another embodiment, a preparation of oligonucleotides comprising 10 or more consecutive 
nucleotides complementary to a nucleotide sequence encoding an NKCC1 polypeptide for use as vaccines 
for the treatment of breast, lung and/or pancreatic cancer. Such preparations may include adjuvants or 
other vehicles. 

In a specific embodiment, nucleic acids comprising a sequence encoding an NKCC1 polypeptide, 
are administered to promote polypeptide function by way of gene therapy. Gene therapy refers to 
administration to a subject of an expressed or expressible nucleic acid. In this embodiment, the nucleic 
acid produces its encoded protein that mediates a therapeutic effect by promoting polypeptide function. 
Any of the methods for gene therapy available in the art can be used according to the present invention. 

In a preferred aspect, the compound comprises an NKCC1 nucleic acid, such as a nucleic acid 
encoding an NKCC1 polypeptide, said nucleic acid being part of an expression vector that expresses an 
NKCC1 polypeptide or chimeric protein thereof in a suitable host. In particular, such a nucleic acid has a 
promoter operably linked to the polypeptide coding region, said promoter being inducible or constitutive 
(and, optionally, tissue-specific). In another particular embodiment, a nucleic acid molecule is used in 
which the coding sequences and any other desired sequences are flanked by regions that promote 
homologous recombination at a desired site in the genome, thus providing for intrachromosomal 
expression of the nucleic acid (Koller & Smithies, 1989, Proc. Natl. Acad. Sci. USA 86:8932-8935; 
Zijlstra et al., 1989, Nature 342:435-438). 

Delivery of the nucleic acid into a patient may be direct, in which case the patient is directly 
exposed to the nucleic acid or nucleic acid-carrying vector; this approach is known as in vivo gene 
therapy. Alternatively, delivery of the nucleic acid into the patient may be indirect, in which case cells are 
first transformed with the nucleic acid in vitro and then transplanted into the patient; this approach is 

known as ex vivo gene therapy. 

A convenient means for detecting/quantifying the NKCC1 polypeptides involves the use of 
antibodies. Thus, the NKCC1 polypeptides also find use in raising antibodies. Thus, in a further aspect, the 
present invention provides the use of an antibody which binds to at least one NKCC1 polypeptide for 
screening for and/or diagnosis of breast, lung and/or pancreatic cancer in a subject Preferably, the antibody 
is used for detecting and/or quantifying the amount of an NKCC1 polypeptide in a biological sample obtained 
from said subject. 

In one embodiment, binding of antibody in tissue sections can be used to detect aberrant 
polypeptide localisation or an aberrant level of polypeptide. In a specific embodiment, an antibody to an 
NKCC1 polypeptide can be used to assay a patient tissue (e.g. a breast biopsy) for the level of the 
polypeptide where an aberrant level of polypeptide is indicative of breast, lung and/or pancreatic cancer. 
As used herein, an "aberrant level" means a level that is increased or decreased compared with the level in 
a subject free from breast, lung and/or pancreatic cancer or a reference level. 
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Suitable immunoassays include, without limitation, competitive and non-competitive assay 
systems using techniques such as western blots, radioimmunoassays, ELISA (enzyme linked 
immunosorbent assay), "sandwich" immunoassays, immunoprecipitation assays, precipitin reactions, gel 
diffusion precipitin reactions, immunodiffusion assays, agglutination assays, complement-fixation assays, 
immunoradiometric assays, fluorescent immunoassays and protein A immunoassays. 

NKCC1 polypeptides, may be used as immunogens to generate antibodies which 
immunospecifically bind such an immunogen, these are referred to herein as NKCC1 antibodies. NKCC1 
antibodies include, but are not limited to polyclonal, monoclonal, bispecific, humanised or chimeric 
antibodies, single chain antibodies, Fab fragments and F(ab') fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. 
The term "antibody" as used herein refers to immunoglobulin molecules and immunologically-active 
portions of immunoglobulin molecules, i.e. molecules that contain an antigen binding site that specifically 
binds an antigen. The immunoglobulin molecules of the invention can be of any class (e.g. IgG, IgE, 
IgM, IgD and IgA) or subclass of immunoglobulin molecule. Preferred antibodies bind specifically to 
NKCC1 polypeptides so that they can be used to purify and/or inhibit the activity of such polypeptides. 

In the production of antibodies, screening for the desired antibody can be accomplished by 
techniques known in the art, e.g. ELISA (enzyme-linked immunosorbent assay). For example, to select 
antibodies, which recognise a specific domain of a polypeptide used in the invention, one may assay 
generated hybridomas for a product which binds to a polypeptide fragment containing such domain. For 
selection of an antibody that specifically binds a first polypeptide homologue but which does not 
specifically bind to (or binds less avidly to) a second polypeptide homologue, one can select on the basis 
of positive binding to the first polypeptide homologue and a lack of binding to (or reduced binding to) the 
second polypeptide homologue. 

For preparation of NKCC1 monoclonal antibodies (mAbs), any technique which provides for the 
production of antibody molecules by continuous cell lines in culture may be used. For example, the 
hybridoma technique originally developed by Kohler and Milstein (1975, Nature 256:495^97), as well as 
the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 1983, Immunology Today 
4:72), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of 
any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 
producing the mAbs in the invention may be cultivated in vitro or in vivo. In an additional embodiment 
of the invention, mAbs can be produced in germ-free animals utilising known technology. 

The mAbs include but are not limited to human mAbs and chimeric mAbs (e.g. human-mouse 
chimeras). A chimeric antibody is a molecule in which different portions are derived from different 
animal species, such as those having a human immunoglobulin constant region and a variable region 
derived from a murine mAb. (See, e.g. U.S. Patent No. 4,816,567; and U.S. Patent No. 4,816397). 
Humanised antibodies are antibody molecules from non-human species having one or more 
complementarity determining regions (CDRs) from the non-human species and a framework region from 
a human immunoglobulin molecule. (See, e.g. U.S. Patent No. 5,585,089). 
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Chimeric and humanised mAbs can be produced by recombinant DNA techniques known in the 
art, for example using methods described in WO 87/02671; EP 184,187; EP 171,496; EP 173,494;. WO 
86/01533; U.S. 4,816,567; EP 125,023; Better et al., 1988, Science 240: 1041-1043; Liu et al., 1987, Proc. 
Natl Acad. Sci. USA 84:3439-3443; Liu et al., 1987, J. Immunol. 139:3521-3526; Sun et al, 1987, Proc. 
5 Natl Acad. Sci. USA 84:214-218; Nishimura et al., 1987, Cane. Res. 47:999-1005; Wood et al., 1985, 
Nature 314:446-449; Shaw et al., 1988, J. Natl. Cancer Inst. 80: 1553-1559; Morrison, 1985, Science 
229- 1202-1207; Oi et al., 1986, Bio/Techniques 4:214; U.S. Patent 5,225,539; Jones et al., 1986, Nature 
321:552-525; Verhoeyan era/. (1988) Science 239:1534; and Beidler aZ., 1988, J. Immunol. 
141:4053^060. 

10 Completely human antibodies are particularly desirable for therapeutic treatment of human 

patients. Such antibodies can be produced using transgenic mice which are incapable of expressing 
endogenous immunoglobuUn heavy and light chain genes, but which can express human heavy and light 
chain genes. The transgenic mice are immunised in the normal fashion with a selected antigen, e.g. all or 
a portion of a polypeptide for use in the invention. mAbs directed against the antigen can be obtained 
15 using conventional hybridoma technology. The human immunoglobulin transgenes harboured by the 
transgenic mice rearrange during B cell differentiation, and subsequently undergo class switching and 
somatic mutation. Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA, 
IgM IgD and IgE antibodies. For an overview of this technology for producing human antibodies, see 
Lonberg andHuszar (1995, Int. Rev. Immunol. 13:65-93). For a detailed discussion of this technology 
20 for producing human antibodies and human monoclonal antibodies and protocols for producing such 
antibodies, see, e.g. U.S. Patent 5,625,126; U.S. Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5,661,016; and U.S. Patent 5,545,806. 

Completely human antibodies, which recognise a selected epitope, can be generated using a 
technique referred to as "guided selection." In this approach a selected non-human monoclonal antibody, 
25 e.g. a mouse antibody, is used to guide the selection of a completely human antibody recognising the 
same epitope. (Jespers et al. (1994) Bio/technology 12:899-903). 

The NKCC1 antibodies can also be generated using various phage display methods known in the 
art. In phage display methods, functional antibody domains are displayed on the surface of phage 
particles which carry the polynucleotide sequences encoding them. In a particular, such phage can be 
30 utilised to display antigen binding domains expressed from a repertoire or combinatorial antibody library 
(e.g. human or murine). Phage expressing an antigen binding domain that binds the antigen of interest 
can be selected or identified with antigen, e.g. using labelled antigen or antigen bound or captured to a 
solid surface or bead. Phage used in these methods are typically filamentous phage including fd and M13 
binding domains expressed from phage with Fab, Fv or disulphide stabilised Fv antibody domains 
35 recombinant* fused to either the phage gene D3 or gene VD1 protein. Phage display methods that can be 
used to make the antibodies of the present invention include those disclosed in Bnnkman et al., J. 
Immunol. Methods 182: 41-50 (1995); Ames et al., J. Immunol. Methods 184:177-186 (1995); 
Kettleborough et aCiur. j. Immunol. 24:952-958 (1994); Persic et al., Gene 187 9-18 (1997); Burton et 
al., Advances in Immunology 57:191-280 (1994);. WO92/01047; WO 90/02809; WO 91/10737; WO 
40 92/01047- WO 92/18619; WO 93/1 1236; WO 95/15982; WO 95/20401; and U.S. Patent Nos. 5,698,426; 
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5,223,409; 5,403,484; 5,580,717; 5,427,908; 5,750,753; 5,821,047; 5,571,698; 5,427,908; 5,516,637; 
5,780,225; 5,658,727; 5,733,743 and 5,969,108. 

As described in the above references, after phage selection, the antibody coding regions from the 
phage can be isolated and used to generate whole antibodies, including human antibodies, or any other 

5 desired antigen binding fragment, and expressed in any desired host, including mammalian cells, insect 
cells, plant cells, yeast, and bacteria, e.g. as described in detail below. For example, techniques to 
recombinant produce Fab, Fab 1 and F(ab*)2 fragments can also be employed using methods known in 
the art such as those disclosed in WO 92/22324; Mullinax et al., BioTechniques 12(6):864-869 (1992); 
and Sawai et al., AJRI 34:26-34 (1995); and Better et al., Science 240:1041-1043 (1988), 

10 Examples of techniques which can be used to produce single-chain Fvs and antibodies include 

those described in U.S. Patents 4,946,778 and 5,258,498; Huston et a/., Methods in Enzymology 203:46- 
88 (1991); Shu et al, PNAS 90:7995-7999 (1993); and SkeiTa et a/., Science 240:1038-1040 (1988). 

The invention further provides for the use of bispecific antibodies, which can be made by 
methods known in the art. Traditional production of full length bispecific antibodies is based on the 

15 coexpression of two immunoglobulin heavy chain-light chain pairs, where the two chains have different 
specificities (Milstein et al., 1983, Nature 305:537-539). Because of the random assortment of 
immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential mixture of 10 
different antibody molecules, of which only one has the correct bispecific structure. Purification of the 
correct molecule, which is usually done by affinity chromatography steps, is rather cumbersome, and the 

20 product yields are low. Similar procedures are disclosed in WO 93/08829, and in Traunecker et al., 1991, 
EMBO J. 10:3655-3659. 

According to a different and more preferred approach, antibody variable domains with the desired 
binding specificities (antibody-antigen combining sites) are fused to immunoglobulin constant domain 
sequences. The fusion preferably is with an immunoglobulin heavy chain constant domain, comprising at 

25 least part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light chain binding, present in at least one of the fusions. DNAs 
encoding the immunoglobulin heavy chain fusions and, if desired, the immunoglobulin light chain, are 
inserted into separate expression vectors, and are co-transfected into a suitable host organism. This 
provides for great flexibility in adjusting the mutual proportions of the three polypeptide fragments in 

30 embodiments when unequal ratios of the three polypeptide chains used in the construction provide the 
optimum yields. It is, however, possible to insert the coding sequences for two or all three polypeptide 
chains in one expression vector when the expression of at least two polypeptide chains in equal ratios 
results in high yields or when the ratios are of no particular significance. 

In a preferred embodiment of this approach, the bispecific antibodies are composed of a hybrid 

35 immunoglobulin heavy chain with a first binding specificity in one arm, and a hybrid immunoglobulin 
heavy chain-light chain pair (providing a second binding specificity) in the other arm. It was found that 
this asymmetric structure facilitates the separation of the desired bispecific compound from unwanted 
immunoglobulin chain combinations, as the presence of an immunoglobulin light chain in only one half 
of the bispecific molecule provides for a facile way of separation. This approach is disclosed in WO 
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94/04690. For father details for generating bispecific antibodies see, for example, Suiesh et al., Methods 
inEnzymology, 1986, 121:210. 

The invention provides functionally-active fragments, derivatives or analogues of the anti- 
polypeptide immunoglobulin molecules. "Functionally-active" means that the fragment, derivative or 
5 analogue is able to elicit anti-anti-idiotype antibodies (i.e., tertiary antibodies) that recognise the same 
antigen that is recognised by the antibody from which the fragment, derivative or analogue is derived. 
Specifically, in a preferred embodiment, the antigenicity of the idiotype of the immunoglobulin molecule 
may be enhanced by deletion of framework and CDR sequences that are C-terminal to the CDR sequence 
that specifically recognises the antigen. To determine which CDR sequences bind the antigen, synthetic 
10 peptides containing the CDR sequences can be used in binding assays with the antigen by any binding 
assay method known in the art. 

The present invention provides antibody fragments such as, but not limited to, F(ab')2 fragments 
and Fab fragments. Antibody fragments which recognise specific epitopes may be generated by known 
techniques. F(ab , )2 fragments consist of the variable region, the light chain constant region and the CHI 
15 domain of the heavy chain and are generated by pepsin digestion of the antibody molecule. Fab 

fragments are generated by reducing the disulphide bridges of the F(ab')2 fragments. The invention also 
provides heavy chain and light chain dimers of the NKCC1 antibodies, or any minimal fragment thereof 
such as Fvs or single chain antibodies (SCAs) (e.g. as described in U.S. Patent 4,946,778; Bird, 1988, 
Science 242:423-42; Huston et al., 1988, Proc. Natl. Acad. Sci. USA 85:5879-5883; and Ward et al., 
20 1989, Nature 334:544-54), or any other molecule with the same specificity as the NKCC1 antibody. 

Single chain antibodies are formed by linking the heavy and light chain fragments of the Fv region via an 
amino acid bridge, resulting in a single chain polypeptide. Techniques for the assembly of functional Fv 
fragments in E. coli may be used (Skerra et al., 1988, Science 242:1038-1041). 

In other embodiments, the invention provides fusion proteins of the NKCC1 antibodies (or 
25 functionally active fragments thereof), for example in which the antibody is fused via a covalent bond 
(e.g. a peptide bond), at either the N-terminus or the C-terminus to an amino acid sequence of another 
protein (or portion thereof, preferably at least 10, 20 or 50 amino acid portion of the protein) that is not 
the antibody. Preferably the antibody, or fragment thereof, is covalently linked to the other protein at the 
N-terminus of the constant domain. As stated above, such fusion proteins may facilitate purification, 
30 increase half-life in vivo, and enhance the delivery of an antigen across an epithelial barrier to the 
immune system. 

The NKCC1 antibodies include analogues and derivatives that are either modified, i.e., by the 
covalent attachment of any type of molecule as long as such covalent attachment that does not impair 
immunospecific binding. For example, but not by way of limitation, the derivatives and analogues of the 

35 immunoglobulins include those that have been further modified, e.g. by glycosylation, acetylation, 

pegylation, phosphylation, amidation, derivatisation by known protecting/blocking groups, proteolytic 
cleavage, linkage to a cellular ligand or other protein, etc. Any of numerous chemical modifications may 
be carried out by known techniques, including, but not limited to specific chemical cleavage, acetylation, 
formylation, etc. Additionally, the analogue or derivative may contain one or more non-classical amino 

40 acids. 
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The foregoing antibodies can be used in methods known in the art relating to the localisation and 
activity of the polypeptides used in the invention, e.g. for imaging or radioimaging these proteins, 
measuring levels thereof in appropriate physiological samples, in diagnostic methods, etc. and for 
radiotherapy. . 

5 The NKCC1 antibodies can be produced by any method known in the art for the synthesis of 

antibodies, in particular, by chemical synthesis or by recombinant expression, and are preferably 
produced by recombinant expression technique. 

Recombinant expression of antibodies, or fragments, derivatives or analogues thereof, requires 
construction of a nucleic acid that encodes the antibody. If the nucleotide sequence of the antibody is 

10 known, a nucleic acid encoding the antibody may be assembled from chemically synthesised 

oligonucleotides (e.g. as described in Kutmeier et aU 1994, BioTechniques 17:242), which, briefly, 
involves the synthesis of overlapping oligonucleotides containing portions of the sequence encoding 
antibody, annealing and ligation of those oligonucleotides, and then amplification of the ligated 
oligonucleotides by PCR. 

15 Alternatively, the nucleic acid encoding the antibody may be obtained by cloning the antibody. If 

a clone containing the nucleic acid encoding the particular antibody is not available, but the sequence of 
the antibody molecule is known, a nucleic acid encoding the antibody may be obtained from a suitable 
source (e.g. an antibody cDNA library, or cDNA library generated from any tissue or cells expressing the 
antibody) by PCR amplification using synthetic primers hybridisable to the 3' and 5* ends of the sequence 
20 or by cloning using an oligonucleotide probe specific for the particular gene sequence. 

If an antibody molecule that specifically recognises a particular antigen is not available (or a 
source for a cDNA library for cloning a nucleic acid encoding such an antibody), antibodies specific for a 
particular antigen may be generated by any method known in the art, for example, by immunising an 
animal, such as a rabbit, to generate polyclonal antibodies or, more preferably, by generating monoclonal 
25 antibodies. Alternatively, a clone encoding at least the Fab portion of the antibody may be obtained by 
screening Fab expression libraries (e.g. as described in Huse et al 9 1989, Science 246:1275-1281) for 
clones of Fab fragments that bind the specific antigen or by screening antibody libraries (See, e.g. 
Clackson et aU 1991, Nature 352:624; Hane et cd., 1997 Proc. Natl. Acad. Sci. USA 94:4937). 

Once a nucleic acid encoding at least the variable domain of the antibody molecule is obtained, it 
30 may be introduced into a vector containing the nucleotide sequence encoding the constant region of the 
antibody molecule (see, e.g. WO 86/05807; WO 89/01036; and U.S. 5,122,464). Vectors containing the 
complete light or heavy chain for co-expression with the nucleic acid to allow the expression of a 
complete antibody molecule are also available. Then, the nucleic acid encoding the antibody can be used 
to introduce the nucleotide substitution(s) or deletion(s) necessary to substitute (or delete) the one or more 
35 variable region cysteine residues participating in an intrachain disulphide bond with an amino acid residue 
that does not contain a sulphydryl group. Such modifications can be carried out by any method known in 
the art for the introduction of specific mutations or deletions in a nucleotide sequence, for example, but 
not limited to, chemical mutagenesis, in vitro site directed mutagenesis (Hutchinson et al., 1978, J. Biol. 
Chem. 253:6551), PCR based methods, etc. 
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In addition, techniques developed for the production of "chimeric antibodies" (Morrison et al., 

1984, Proc: Natl. Acad. Sci. 81:851-855; Neuberger et al., 1984, Nature 312:604-608; Takeda et al., 

1985, Nature 3 14:452-454) by splicing genes from a mouse antibody molecule of appropriate antigen 
specificity together with genes from a human antibody molecule of appropriate biological activity can be 

5 used. As described supra, a chimeric antibody is a molecule in which different portions are derived from 
different animal species, such as those having a variable region derived from a murine mAb and a human 
antibody constant region, e.g. humanised antibodies. 

Once a nucleic acid encoding an antibody molecule has been obtained, the vector for the 
production of the antibody molecule may be produced by recombinant DNA technology using techniques 
10 well known in the art. Thus, methods for preparing the polypeptides used in the invention by expressing 
nucleic acid containing the antibody molecule sequences are described herein. Methods which are well 
known to those skilled in the art can be used to construct expression vectors containing an antibody 
molecule coding sequences and appropriate transcriptional and translational control signals. These 
methods include, for example, in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
15 genetic recombination. See, for example, the techniques described in Sambrook et al. (1990, Molecular 
Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) and 
Ausubel et al. (eds., 1998, Current Protocols in Molecular Biology, J ohn WUey & Sons, NY). 

The expression vector is transferred to a host cell by conventional techniques and the transfected 
cells are then cultured by conventional techniques to produce an NKCC1 antibody. 
20 The host cells used to express a recombinant NKCC1 antibody may be either bacterial cells such 

as Escherichia coli, or, preferably, eukaryotic cells, especially for the expression of whole recombinant 
antibody molecule. In particular, mammalian cells such as Chinese hamster ovary cells (CHO), in 
conjunction with a vector such as the major intermediate early gene promoter element from human 
cytomegalovirus is an effective expression system for antibodies (Foecking et al., 1986, Gene 45:101; 
25 Cockett et al, 1990, BioTechnology 8:2). 

A variety of host-expression vector systems may be utilised to express an NKCC1 antibody 
molecule. Such host-expression systems represent vehicles by which the coding sequences of interest 
may be produced and subsequently purified, but also represent cells which may, when transformed or 
transfected with the appropriate nucleotide coding sequences, express the NKCC1 antibody in situ. These 
30 include but are not limited to microorganisms such as bacteria (e.g. E. coli, B. subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing antibody 
coding sequences; yeast (e.g. Saccharomyces, Pichia) transformed with recombinant yeast expression 
vectors containing antibody coding sequences; insect cell systems infected with recombinant virus 
expression vectors (e.g. baculovirus) containing the antibody coding sequences; plant cell systems 
35 infected with recombinant virus expression vectors (e.g. cauliflower mosaic virus, CaMV; tobacco mosaic 
virus, TMV) or transformed with recombinant plasmid expression vectors (e.g. Ti plasmid) containing 
antibody coding sequences; or mammalian cell systems (e.g. COS, CHO, BHK, HEK 293, 3T3 cells) 
harbouring recombinant expression constructs containing promoters derived from the genome of 
mammalian cells (e.g. metallothionein promoter) or from mammalian viruses (e.g. the adenovirus late 
40 promoter, the vaccinia virus 7.5K promoter). 
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In bacterial systems, a number of expression vectors may be advantageously selected depending 
upon the use intended for the antibody molecule being expressed. For example, when a large quantity of 
such a protein is to be produced, for the generation of pharmaceutical compositions comprising an 
antibody molecule, vectors which direct the expression of high levels of fusion protein products that are 
readily purified may be desirable. Such vectors include, but are not limited, to the E. coli expression 
vector pUR278 (Ruther et al., 1983, EMBO J. 2: 1791), in which the antibody coding sequence may be 
ligated individually into the vector in frame with the lac Z coding region so that a fusion protein is 
produced; pIN vectors (Inouye & Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke & 
Schuster, 1989, J. Biol. Chem. 24:5503-5509); and the like. pGEX vectors may also be used to express 
foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In general, such fusion 
proteins are soluble and can easily be purified from lysed cells by adsorption and binding to a matrix 
" glutathione-agarose beads followed by elution in the presence of free glutathione. The pGEX vectors are 
designed to include thrombin or factor Xa protease cleavage sites so that the cloned target gene product 
can be released from the GST moiety. 
15 In an insect system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a. 

vector to express foreign genes. The virus grows in Spodopterafrugiperda cells. The antibody coding 
sequence may be cloned individually into non-essential regions (for example, the polyhedrin gene) of the 
virus and placed under control of an AcNPV promoter (for example, the polyhedrin promoter). In 
mammalian host cells, a number of viral-based expression systems (e.g. an adenovirus expression system) 
20 may be utilised. 

As discussed above, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion desired. Such 
modifications (e.g. glycosylation) and processing (e.g. cleavage) of protein products may be important for 
the activity of the protein. 

25 For long-term, high-yield production of recombinant antibodies, stable expression is preferred. 

For example, cells lines that stably express an antibody of interest can be produced by transfecting the 
cells with an expression vector comprising the nucleotide sequence of the antibody and the nucleotide 
sequence of a selectable (e.g. neomycin or hygromycin), and selecting for expression of the selectable 
marker. Such engineered cell lines may be particularly useful in screening and evaluation of compounds 

30 that interact directly or indirectly with the antibody molecule. 

The expression levels of the antibody molecule can be increased by vector amplification (for a 
review, see Bebbington and Hentschel, The use of vectors based on gene amplification for the expression 
of cloned genes in mammalian cells in DNA cloning, Vol.3. (Academic Press, New York, 1987)). When 
a marker in the vector system expressing antibody is amplifiable, increase in the level of inhibitor present 

35 in culture of host cell will increase the number of copies of the marker gene. Since the amplified region is 
associated with the antibody gene, production of the antibody will also increase (Crouse et al., 1983, Mol. 
Cell. Biol. 3:257). 

The host cell may be co-transfected with two expression vectors for use within the invention, the 
first vector encoding a heavy chain derived polypeptide and the second vector encoding a tight chain 
40 derived polypeptide. The two vectors may contain identical selectable markers which enable equal 
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expression of heavy and light chain polypeptides. Alternatively, a single vector may be used which 
encodes both heavy and light chain polypeptides. In such situations, the light chain should be placed 
before the heavy chain to avoid an excess of toxic free heavy chain (Proudfoot, 1986, Nature 322:52; 
Kohler, 1980, Proc. Natl. Acad. Sci. USA 77:2197). The coding sequences for the heavy and light chains 
may comprise cDNA or genomic DNA. 

Once the NKCC1 antibody has been recombinantly expressed, it may be purified by any method 
known in the art for purification of an antibody molecule, for example, by chromatography (e.g. ion 
exchange chromatography, affinity chromatography such as with protein A or specific antigen, and sizing 
column chromatography), centrifugation, differential solubility, or by any other standard technique for the 
purification of proteins. 

Alternatively, any fusion protein may be readily purified by utilising an antibody specific for the 
fusion protein being expressed. For example, a system described by Janknecht et al. allows for the ready 
purification of non-denatured fusion proteins expressed in human cell lines (Janknecht et aZ., 1991, Proc. 
Natl. Acad. Sci. USA 88:8972-897). In this system, the gene of interest is subcloned into a vaccinia 
recombination plasmid such that the open reading frame of the gene is translationally fused to an amino- 
terminal tag consisting of six histidine residues. The tag serves as a matrix binding domain for the fusion 
protein. Extracts from cells infected with recombinant vaccinia virus are loaded onto Ni 2+ nitriloacetic 
acid-agarose columns and histidine-tagged proteins are selectively eluted with imidazole-containing 
buffers. 

In a preferred embodiment, NKCC1 antibodies are conjugated to a diagnostic or therapeutic 
moiety. The antibodies can be used for diagnosis or to determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, radioactive nuclides, positron emitting metals (for use in positron 
emission tomography), and nonradioactive paramagnetic metal ions. See generally U.S. Patent No. 
4,741,900 for metal ions which can be conjugated to antibodies for use as diagnostics according to the 
present invention. Suitable enzymes include horseradish peroxidase, alkaline phosphatase, beta- 
galactosidase, or acetylcholinesterase; suitable prosthetic groups include streptavidin, avidin and biotin; 
suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride and phycoerythrin; suitable luminescent materials 
include luminol; suitable bioluminescent materials include luciferase, luciferin, and aequorin; and suitable 
radioactive nuclides include 125 1, 131 I, ni In, and 99 Tc. 

NKCC1 antibodies can be conjugated to a therapeutic agent or drug moiety to modify a given 
biological response. The therapeutic agent or drug moiety is not to be construed as limited to classical 
chemical therapeutic agents. For example, the drug moiety may be a protein or polypeptide possessing a 
desired biological activity. Such proteins may include, for example, a toxin such as abrin, ricin A, 
pseudomonas exotoxin, or diphtheria toxin; a protein such as tumour necrosis factor, oc-interferon, fi- 
interferon, nerve growth factor, platelet derived growth factor, tissue plasminogen activator, a thrombotic 
agent or an anti-angiogenic agent, e.g. angiostatin or endostatin; or, a biological response modifier such as 
a lymphokine, interleukin-1 (IL-1), interleukin-2 (IL-2), interleukin-6 (IL-6), granulocyte macrophage 
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colony stimulating factor (GM-CSF), granulocyte colony stimulating factor (G-CSF), nerve growth factor 

(NGF) or other growth factor. 

Techniques for conjugating such therapeutic moiety to antibodies are well known, see, e.g. Amon 
et al "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer Therapy", in Monoclonal 
5 Antibodies And Cancer Therapy, Reisfeld et al. (eds.), PP- 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et 
al "Antibodies For Drug Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 
623-53 (Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: 
A Review", in Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), 
pp 475-506 (1985); "Analysis, Results, And Future Prospective Of The Therapeutic Use Of Radiolabeled 
10 Antibody In Cancer Therapy", in Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et 
al. (eds.), pp. 303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62: 1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described in U.S. 4,676,980. 
15 An antibody with or without a therapeutic moiety conjugated to it can be used as a therapeutic 

that is administered alone or in combination with cytotoxic factors) and/or cytokine(s). 

A further aspect of the invention provides methods of screening for active agents that modulate 
(e.g. upregulate or downregulate) a characteristic of, e.g. the expression or the enzymatic or binding 

activity, of an NKCC1 polypeptide. „,',.,» 
20 The invention provides methods for identifying active agents that bind to an NKCC1 polypeptide 

or have a modulatory effect (e.g. stimulatory, inhibitory, up-regulation, downregulation) effect on the 
expression or activity of an NKCC1 polypeptide. Examples of agents, include, but are not limited to, 
nucleic acids (e.g. DNA and RNA), carbohydrates, lipids, proteins, peptides, peptidomimetics, agonists, 
antagonists, small molecules and other drugs. Active agents can be obtained using any of the numerous 
25 suitable approaches in combinatorial library methods known in the art, including: biological libraries; 
spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library methods usmg 
affinity chromatography selection. The biological library approach is limited to peptide libraries, while 
the other four approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of 
30 compounds (Lam, 1997, Anticancer Drug Des. 12: 145; U.S. 5,738,996; and U.S. 5,807,683, each of 
which is incorporated herein in its entirety by reference). 

Examples of methods for the synthesis of molecular libraries can be found in the art, for example 
in- DeWitt et al., 1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et al., 1994, Proc. Nad. Acad. Sci. USA 
91-11422; Zuckermann et al., 1994, J. Med. Chem. 37:2678; Cho et al., 1993, Science 261:1303; Carrell 
35 et al., 1994, Angew. Chem. Int. Ed. Engl. 33:2059; Carell et al., 1994, Angew. Chem. Int. Ed. Engl. 
33:2061; and Gallop et al., 1994, J. Med. Chem. 37:1233, each of which is incorporated herem in its 
entirety by reference. 

Libraries of agents may be presented, e.g. presented in solution (e.g. Houghten, 1992, 
Bio/Techniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 1993, Nature 
40 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (U.S. Patent Nos. 5,571,698; 5,403,484; and 
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5,223,409), plasmids (Cull et al., 1992, Proc. Natl. Acad. Sci. USA 89:1865-1869) or phage (Scott and 
Smith, 1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et al., 1990, Proc. Nad. 
Acad.Sci. USA 87:6378-6382; and Felici, 1991, J. Mol. Biol. 222:301-310), each of which is 
incorporated herein in its entirety by reference. 

In one embodiment, agents that interact with (i.e., bind to) an NKCC1 polypeptide are identified 
in a cell-based assay system. In accordance with this embodiment, cells expressing an NKCC1 
polypeptide are contacted with a candidate agent or a control agent and the ability of the candidate agent 
to interact with the polypeptide is determined. If desired, this assay may be used to screen a plurality (e.g. 
a library) of candidate agents. The cell, for example, can be of prokaryotic origin (e.g. E. coli) or 
eukaryotic origin (e.g. yeast or mammalian). Further, the cells can express the NKCC1 polypeptide 
endogenous^ be genetically engineered to express said polypeptide. In some embodiments, the ^ 
NKCC1 polypeptide or the candidate agent is labelled, for example with a radioactive label (such as P, 
35 S or 125 T) or a fluorescent label (such as fluorescein isothiocyanate, rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, o-phthaldehyde or fluorescamine) to enable detection of an interaction 
between a polypeptide and a candidate agent The ability of the candidate agent to interact directiy or 
indirecdy with the NKCC1 polypeptide can be determined by methods known to those of skill in the art. 
For example, the interaction between a candidate agent and a polypeptide can be determined by flow 
cytometry, a scintillation assay, immunoprecipitation or western blot analysis. 

In another embodiment, agents that interact with (i.e., bind to) an NKCC1 polypeptide are 
identified in a cell-free assay system In accordance with this embodiment, a native or recombinant 
NKCC1 polypeptide is contacted with a candidate agent or a control agent and the ability of the candidate 
agent to interact with the polypeptide is determined. If desired, this assay may be used to screen a 
plurality (e.g. a library) of candidate agents. Preferably, the polypeptide is first immobilised, by, for 
example, contacting the polypeptide with an immobilised antibody which specifically recognises and 
binds it, or by contacting a purified preparation of polypeptide with a surface designed to bind proteins. 
The polypeptide may be partially or completely purified (e.g. partially or completely free of other 
polypeptides) or part of a cell lysate. Further, the polypeptide may be a fusion protein comprising the 
polypeptide for use in the invention or a biologically active portion thereof and a domain such as 
glutathionine-S-transferase. Alternatively, the polypeptide can be biotinylated using techniques well 
known to those of skill in the art (e.g. biotinylation kit, Pierce Chemicals; Rockford, IL). The ability of 
the candidate agent to interact with the polypeptide can be can be determined by methods known to those 
of skill in the art. 

In another embodiment, a cell-based assay system is used to identify active agents that bind to or 
modulate the activity of a protein, such as an enzyme, or a biologically active portion thereof, which is 
responsible for the production or degradation of the NKCC1 polypeptide or is responsible for the post- 
translational modification of the polypeptide. In a primary screen, a plurality (e.g. a library) of agents are 
contacted with cells that naturally or recombinantly express: (i) an NKCC1 polypeptide; and (ii) a protein 
that is responsible for processing of the NKCC1 polypeptide in order to identify compounds that 
modulate the production, degradation, or post-translational modification of the polypeptide. If desired, 
active agents identified in the primary screen can then be assayed in a secondary screen against cells 
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naturally or mcombinantly expressing the specific polypeptide of interest The ability of the candidate 
agent to modulate the production, degradation or post-translational modification of a polypeptide can be 
determined by methods known to those of skill in the art, including without limitation, flow cytometry, a 
scintillation assay, immunoprecipitation and western blot analysis. . 

5 In another embodiment, agents that competitively interact with (i.e., bind to) an NKCC1 

polypeptide are identified in a competitive binding assay. In accordance with this embodiment, cells 
expressing the polypeptide are contacted with a candidate agent and an agent known to interact with the 
polypeptide; the ability of the candidate agent to competitively interact with the polypeptide is then 
determined. Alternatively, agents that competitively interact with (i.e., bind to) a polypeptide are 

10 identified in a cell-free assay system by contacting the polypeptide with a candidate agent and an agent 
known to interact with the polypeptide. As stated above, the ability of the candidate agent to interact 
with a polypeptide for use in the invention can be determined by methods known to those of skill in the 
art. These assays, whether cell-based or cell-free, can be used to screen a plurality (e.g. a library) of 
candidate agents. 

15 In another embodiment, active agents that modulate (i.e., upregulate or downregulate) the 

expression of an NKCC1 polypeptide are identified by contacting cells (e.g. cells of prokaryotic origin or 
eukaryotic origin) expressing the polypeptide with a candidate agent or a control agent (e.g. phosphate 
buffered saline (PBS)) and determining the expression of the polypeptide or mRNA encoding the 
polypeptide. The level of expression of a selected polypeptide or mRNA encoding polypeptide in the 

20 presence of the candidate agent is compared to the level of expression of the polypeptide or mRNA 

encoding the polypeptide in the absence of the candidate agent (e.g. in the presence of a control agent). 
The candidate agent can then be identified as a modulator of the expression of the polypeptide based on 
this comparison. For example, when expression of the polypeptide or mRNA encoding the polypeptide is 
significantly greater in the presence of the candidate agent than in its absence, the candidate agent is 

25 identified as a stimulator of expression of the polypeptide or mRNA encoding the polypeptide. 

Alternatively, when expression of the polypeptide or mRNA encoding the polypeptide is significantly less 
in the presence of the candidate agent than in its absence, the candidate agent is identified as an inhibitor 
of the expression of the polypeptide or mRNA encoding the polypeptide. The level of expression of a 
polypeptide for use in the invention, or the mRNA that encodes it, can be determined by methods known 

30 to those of skill in the art based on the present description. For example, mRNA expression can be 
assessed by Northern blot analysis or RT-PCR, and protein levels can be assessed by western blot 
analysis. 

In another embodiment, active agents that modulate the activity of a polypeptide are identified by 
contacting a preparation containing the polypeptide, or cells (e.g. prokaryotic or eukaryotic cells) 

35 expressing the polypeptide with a candidate agent or a control agent and determining the ability of the 
candidate agent to modulate (e.g. stimulate or inhibit) the activity of polypeptide. The activity of a 
polypeptide can be assessed by detecting its effect on a "downstream effector" for example, but without 
limitation, induction of a cellular signal transduction pathway of the polypeptide (e.g. , intracellular Ca 2+ , 
diacylglycerol, IP3, etc.), detecting catalytic or enzymatic activity of the target on a suitable substrate, 

40 detecting the induction of a reporter gene (e.g. a regulatory element that is responsive to a polypeptide for 
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use in the invention and is operably linked to a nucleic acid encoding a detectable marker, e.g. luciferase), 
or detecting a cellular response, for example, cellular differentiation, or cell proliferation as the case may 
be, based on the present description, techniques known to those of skill in the art can be used for 
measuring these activities (see, e.g. U.S. 5,401,639). The candidate agent can then be identified as a 
5 m()dulatortfmeactivityofapolypepti^ 

candidate agent to the control agent. Suitable control agents include phosphate buffered saline (PBS) and 

normal saline (NS). # , 

In another embodiment, active agents that modulate the expression, activity or both the 
expression and activity of an NKCC1 polypeptide are identified in an animal model. Examples of 

10 suitable animals include, but are not limited to, mice, rats, rabbits, monkeys, guinea pigs, dogs and cats. 
Preferably, the animal used represents a model of breast, pancreatic and/or lung cancer. In accordance 
with this embodiment, the candidate agent or a control agent is administered (e.g. orally, rectally or 
parenterally such as intraperitoneally or intravenously) to a suitable animal and the effect on the 
expression, activity or both expression and activity of the polypeptide is determined. Changes m the 

15 expression of a polypeptide can be assessed by any suitable method described above, based on the present 

description. . . „ . 

In yet another embodiment, a polypeptide for use in the invention is used as a baitprotem in a 

two-hybrid assay or three hybrid assay to identify other proteins that bind to or interact with the 
polypeptide (see, e.g. U.S. 5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. 
20 Chem. 268:12046-12054; Bartel et al. (1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) 

Oncogene 8: 1693-1696; and WO 94/10300). As those skilled in the art will appreciate, such binding 
proteins are also likely to be involved in the propagation of signals by the polypeptides for use in the 
invention as, for example, upstream or downstream elements of a signalling pathway involving the 

NKCC1 polypeptides. J 
25 This invention further provides novel active agents identified by the above-described screening 

assays described above and uses thereof for treatments as described herein. 

The invention also provides for treatment or prevention of breast, pancreatic and/or lung cancer 
by administration of a therapeutic compound, Such compounds include but are not limited to: NKCC1 
polypeptides; antibodies thereto; NKCC1 nucleic acid molecules; antisense nucleic acids to a gene 
30 encoding an NKCC1 polypeptide, and agonists and antagonists of a gene encoding an NKCC1 nucleic 

acid or agonists and antagonists of an NKCC1 polypeptide. An important feature of the present invention 
is the identification of a gene encoding a polypeptide as defined herein involved in breast, pancreatic 
and/or lung cancer, breast, pancreatic and/or lung cancer can be treated or prevented by administration of 
a therapeutic compound that modulates activity or expression of a polypeptide as defined herein in the 
35 tissue of breast, pancreatic and/or lung cancer patients. 

In a further aspect, the present invention provides the use of an NKCCI polypeptide m the 
production of a composition for the treatment or prophylaxis of breast, lung and/or pancreatic cancer, 
wherein the composition is a vaccine. The vaccine optionally comprises one or more suitable adjuvants. 
Examples of adjuvants well-known in the art include inorganic gels, such as aluminium hydroxide, and 
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water-in-oil emulsions, such as incomplete Freund's adjuvant. Other useful adjuvants will be well known 
to the skilled person. 

In yet further aspects, the present invention provides: 

(a) the use of a polypeptide as defined herein in the preparation of an immunogenic 
composition, preferably a vaccine; 

(b) the use of such an immunogenic composition in inducing an immune response in a 
subject; and 

(c) a method for the treatment or prophylaxis of breast, lung and/or pancreatic cancer in a 
subject, or of vaccinating a subject against breast, lung and/or pancreatic cancer which comprises 
the step of administering to the subject an effective amount of an NKCC1 polypeptide, preferably 
as a vaccine. 

As used herein "active agent" refers to the NKCC1 polypeptides and NKCC1 nucleic acid 
molecules, antibodies against said polypeptides and agents e.g. small molecules, which modulate the 
expression of said polypeptides. 

As discussed herein, active agents of the invention find use in the treatment or prophylaxis of 
breast, pancreatic and/or lung cancer. Thus, in a further aspect, the present invention provides a 
pharmaceutical composition comprising at least one active agent, optionally together with one or more 
pharmaceutically acceptable excipients, carriers or diluents. In one embodiment, the pharmaceutical 
composition is for use as a vaccine and so any additional components will be acceptable for vaccine use. 
In addition, the skilled person will appreciate that one or more suitable adjuvants may be added to such 
vaccine preparations. 

The composition will usually be supplied as part of a sterile, pharmaceutical composition that will 
normally include a pharmaceutically acceptable carrier. This pharmaceutical composition may be in any 
suitable form (depending upon the desired method of administering it to a patient). 

It may be provided in unit dosage form, will generally be provided in a sealed container and may be 
provided as part of a kit Such a kit would normally (although not necessarily) include instructions for use. It 
may include a plurality of said unit dosage forms. 

The pharmaceutical composition may be adapted for administration by any appropriate route, for 
example by the oral (including buccal or sublingual), rectal, nasal, topical (including buccal, sublingual or 
transdermal), vaginal or parenteral (including subcutaneous, intramuscular, intravenous or intradermal) route. 
Such compositions may be prepared by any method known in the art of pharmacy, for example by admixing 
the active ingredient with the carrier(s) or excipient(s) under sterile conditions. 

Pharmaceutical compositions adapted for oral administration may be presented as discrete units such 
as capsules or tablets; as powders or granules; as solutions, syrups or suspensions (in aqueous or non-aqueous 
liquids; or as edible foams or whips; or as emulsions). 

Suitable excipients for tablets or hard gelatine capsules include lactose, maize starch or derivatives 
thereof, stearic acid or salts thereof. 

Suitable excipients for use with soft gelatine capsules include for example vegetable oils, waxes, fats, 
semi-solid, or liquid polyols etc. 




For the preparation of solutions and syrups, excipients which may be used include for example water, 
polyols and sugars. For the preparation of suspensions, oils (e.g. vegetable oils) may be used to provide oil- 
in-water or water in oil suspensions. 

Pharmaceutical compositions adapted for transdermal administration may be presented as discrete 
5 patches intended to remain in intimate contact with the epidermis of the recipient for a prolonged period of 
time. For example, the active ingredient may be delivered from the patch by iontophoresis as generally 
described in Pharmaceutical Research, 3(6):318 (1986). 

Pharmaceutical compositions adapted for topical administration may be formulated as ointments, 
creams, suspensions, lotions, powders, solutions, pastes, gels, sprays, aerosols or oils. For infections of the 
10 eye or other external tissues, for example mouth and skin, the compositions are preferably applied as a topical 
ointment or cream. When formulated in an ointment, the active ingredient may be employed with either a 
paraffinic or a water-miscible ointment base. Alternatively, the active ingredient may be formulated in a 
cream with an oil-in-water cream base or a water-in-oil base. Pharmaceutical compositions adapted for 
topical administration to the eye include eye drops wherein the active ingredient is dissolved or suspended in 
15 a suitable carrier, especially an aqueous solvent. Pharmaceutical compositions adapted for topical 
administration in the mouth include lozenges, pastilles and mouth washes. 

Pharmaceutical compositions adapted for rectal administration may be presented as suppositories or 

enemas. 

Pharmaceutical compositions adapted for nasal administration wherein the carrier is a solid include a 
10 coarse powder having a particle size for example in the range 20 to 500 microns which is administered in the 
manner in which snuff is taken, i.e. by rapid inhalation through the nasal passage from a container of the 
powder held close up to the nose. Suitable compositions wherein the carrier is a liquid, for administration as 
a nasal spray or as nasal drops, include aqueous or oil solutions of the active ingredient. 

Pharmaceutical compositions adapted for administration by inhalation include fine particle dusts or 
25 mists which may be generated by means of various types of metered dose pressurised aerosols, nebulisers or 
insufflators. 

Pharmaceutical compositions adapted for vaginal administration may be presented as pessaries, 
tampons, creams, gels, pastes, foams or spray compositions. 

Pharmaceutical compositions adapted for parenteral administration include aqueous and non-aqueous 
30 sterile injection solution which may contain antioxidants, buffers, bacteriostats and solutes which render the 
composition substantially isotonic with the blood of the intended recipient; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents and thickening agents. Excipients which may be 
used for injectable solutions include water, alcohols, polyols, glycerine and vegetable oils, for example. The 
compositions may be presented in unit-dose or multi-dose containers, for example sealed ampoules and vials, 
35 and may be stored in a freeze-dried (lyophilised) condition requiring only the addition of the sterile liquid 
carried, for example water for injections, immediately prior to use. Extemporaneous injection solutions and 
suspensions may be prepared from sterile powders, granules and tablets. 

The pharmaceutical compositions may contain preserving agents, solubilising agents, stabilising 
agents, wetting agents, emulsifiers, sweeteners, colourants, odourants, salts (polypeptides for use in the 
40 present invention may themselves be provided in the form of a pharmaceutically acceptable salt), buffers, 
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coating agents or antioxidants. They may also contain therapeutically active agents in addition to the 
polypeptide for use in the present invention. 

Dosages of the polypeptide, nucleic acid or antibody for use in the present invention can vary 
between wide limits, depending upon the stage of the breast, lung and/or pancreatic cancer, the age and 
5 condition of the individual to be treated, etc. and a physician will ultimately determine appropriate dosages to 
be used. This dosage may be repeated as often as appropriate. If side effects develop the amount and/or 
frequency of the dosage can be reduced, in accordance with normal clinical practice. 

In view of the importance of NKCC1 in breast, pancreatic and/or lung cancer the following form 
additional aspects of the present invention: 
10 i) a method for monitoring/assessing breast, pancreatic and/or lung cancer treatment in a 

patient, which comprises the step of determining the presence or absence and/or quantifying at 
least one NKCC1 polypeptide, at least one NKCC1 nucleic acid molecule or at least one 
antibody for use in the invention in a biological sample obtained from said patient; 
ii) methods of treating breast, pancreatic and/or lung cancer, comprising administering to a 
15 patient a therapeutically effective amount of a compound that modulates (e.g. upregulates.or 

downregulates) or complements the expression or the biological activity (or both) of an 
NKCC1 polypeptide in patients having breast, pancreatic and/or lung cancer, in order to (a) 
prevent the onset or development of breast, lung and/or pancreatic cancer, (b) prevent the 
progression of breast, pancreatic and/or lung cancer; or (c) ameliorate the symptoms of breast, 
20 pancreatic and/or lung cancer. 

In the context of the present invention, the biological sample can be obtained from any source, 
such as a serum sample or a tissue sample, e.g. breast, lung, or pancreatic tissue. 

The invention also provides diagnostic kits, comprising an NKCC1 antibody. In addition, such a 
kit may optionally comprise one or more of the following: 
25 (1) instructions for using the antibody for diagnosis, prognosis, therapeutic monitoring or any 

combination of these applications; 

(2) a labelled binding partner to the antibody; 

(3) a solid phase (such as a reagent strip) upon which the antibody is immobilised; and 

(4) a label or insert indicating regulatory approval for diagnostic, prognostic or therapeutic use or 
30 any combination thereof. 

If no labelled binding partner to the antibody is provided, the anti-NKCCl antibody itself can be 
labelled with a detectable marker, e.g. a chemiluminescent, enzymatic, fluorescent, or radioactive moiety. 

The invention also provides a kit comprising a nucleic acid probe capable of hybridising to RNA 
encoding an NKCC1 polypeptide. In a specific embodiment, a kit comprises in one or more containers a 
35 pair of primers (e.g. each in the size range of 6-30 nucleotides, more preferably 10-30 nucleotides and still 
more preferably 10-20 nucleotides) that under appropriate reaction conditions can prime amplification of 
at least a portion of a nucleic acid encoding a polypeptide as defined herein, such as by polymerase chain 
reaction (see e.g. Innis et al., 1990, PCR Protocols, Academic Press, Inc., San Diego, CA), ligase chain 
reaction (see EP 320,308) use of Qj3 replicase, cyclic probe reaction, or other methods known in the art. 




The invention provides methods for identifying active agents or candidate agents that bind to a 
polypeptide as defined herein or have a stimulatory or inhibitory effect on the expression or activity of a 
polypeptide as defined herein. 

The present invention also provides assays for use in drug discovery in order to identify or verify 

5 the efficacy of agents for treatment or prevention of breast, lung and/or pancreatic cancer. Candidate 
agents can be assayed for their ability to modulate levels of a polypeptide as defined herein in a subject 
having breast, lung and/or pancreatic cancer. Agents able to modulate levels of a polypeptide as defined 
herein in a subject having breast, lung and/or pancreatic cancer towards levels found in subjects free from 
breast! lung and/or pancreatic cancer or to produce similar changes in experimental animal models of 

10 breast, lung and/or pancreatic cancer can be used as lead agents for further drug discovery, or used 

therapeutically. Expression of an NKCC1 polypeptide can be assayed by, for example, immunoassays, 
gel electrophoresis followed by visualisation, detection of activity, or any other method taught herein or 
known to those skilled in the art. Such assays can be used to screen candidate agents, in clinical 
monitoring or in drug development, where abundance of a polypeptide as defined herein can serve as a 

15 surrogate marker for clinical disease. 

This invention further provides novel active agents identified by the above-described screening 
assays and uses thereof for treatments as described herein. In addition, the invention also provides the use 
of an active agent, which interacts with, or modulates the activity of an NKCC1 polypeptide in the 
manufacture of a composition for the treatment of breast, lung and/or pancreatic cancer. 

20 In another aspect, the present invention provides a method for the prophylaxis and/or treatment of 

breast, lung and/or pancreatic cancer in a subject, which comprises administering to said subject a 
therapeutically effective amount of an antibody which binds to at least one NKCC1 polypeptide. 

In yet another aspect, the present invention provides the use of an antibody which binds to at least 
one NKCC1 polypeptide in the preparation of a composition for use in the prophylaxis and/or treatment 

25 of breast, lung and/or pancreatic cancer. In particular, the preparation of compositions comprising or 
consisting of antibodies is another embodiment of this aspect of the invention. 

In one embodiment, one or more active agents are administered alone or in combination with one 
or more additional treatments or therapeutic compounds for breast, lung and/or pancreatic cancer. 
Examples of such treatments include, surgery and radiation therapy. Examples of therapeutic compounds 

30 include but are not limited to cyclophosphamide (Cytoxan™); methotrexate (Methotrexate™); 5- 

fluorouracil (5-FU); paclitaxel (Taxol); docetaxel (Taxotere™); vincristine (Oncovin™); vinblastine 
(Velban™); vinorelbine (Navelbine™); doxorubicin (Adriamycin); tamoxifen (Nolvadex™); toremifene 
(Fareston™); megestrol acetate (Megace™); anastrozole (Arimidex™); goserelin (Zoladex™); anti-HER2 . 
monoclonal antibody (Herceptin™); capecitabine (Xeloda™) and raloxifene hydrochloride (Evista™). 

35 The agents of the invention include but are not limited to any agent, e.g. a small organic 

molecule, protein, peptide, antibody, nucleic acid, etc. that restores the profile towards normal. 

In another embodiment, symptoms of breast, pancreatic and/or lung cancer may be ameliorated 
by decreasing the level or activity of an NKCC1 polypeptide by using gene sequences encoding an 
NKCC1 polypeptide in conjunction with well-known gene "knock-out," ribozyme or triple helix methods 

40 to decrease gene expression of the polypeptide. In this approach, ribozyme or triple helix molecules are 
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used to modulate the activity, expression or synthesis of the gene, and thus to ameliorate the symptoms of 
breast, pancreatic and/or lung cancer. Such molecules may be designed to reduce or infcbit expression of 
a mutant or non-mutant target gene. Techniques for the production and use of such molecules are well 

known to those of skill in the art- . 
5 Endogenous polypeptide expression can also be reduced by inactivatmg or knoclang out the 

~ geneencodingthepolypeptide,o^^ 

(e.g. see Smithies, et «/., 1985, Nature 317:230-234; Thomas & Capeccm, 1987 CeU 51:503-512, 

ThUson^,1989,Ce^^ 

mutant gene encoding a non-functional polypeptide (or a completely unrelated DNA sequence) flanked by 
10 DNA homologous to the endogenous gene (either the coding regions or regulatory regions of the gene 
' encoding the polypeptide) can be used, with or without a selectable marker and/or a negatrve se Stable 
marker, to transit cells that express the target gene in vivo. Insertion of the DNA construct, via targeted 
homologous recombination, results in inactivation of the target gene. This approach can be adapted for 
use in humans, provided the recombinant DNA constructs are directly administered or targeted to the 
15 required site in vivo using appropriate viral vectors. 

Preferred features of each aspect of the invention are as for each of the other aspects^ 
mutandis. The prior art documents mentioned herein are incorporated to the fullest extent permitted by 

The invention will now be described with reference to the following examples, which should not 
20 in any way be construed as limiting the scope of the present invention. The examples refer to the figures 
in which: 

Figure Is shows the nucleotide and amino acid sequences of NKCC1 (GenBank accession: 
U30246; SwissProt accession: P5501 1). The tandem spectra used to identify NKCC1 in breast 
25 and pancreatic cancer cell line membrane preparations are shown in bold, italicised, and 

underlined. Masses assigned to NKCC1 are shown in bold and italicised (see below). 

Figure 2: shows the normal tissue distribution of NKCC1 mRNA. levels of mRNA in normal 
tissues were quantified by real time RT-PCR. mRNA levels are expressed as the number of 
30 copies ng 1 cDNA. 

Figure 3: shows the expression of NKCC1 mRNA in normal and breast cancer tissues levels of 
NKCC1 mRNA in donor matched normal and adjacent tumour tissues were measured by real 
time RT-PCR. mRNA levels are expressed as the number of copies ng cDNA. 



35 



Figure 4: shows the expression of NKCC1 mRNA in normal pancreatic tissues, pancreatic cancer 
tissues, and normal and tumour-derived pancreatic cell lines. Levels of NKCC1 mRNA were 
measured by real time RT-PCR. mRNA levels are expressed as the number of copies ng cDN A. 
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Figures: shows immunohistochemical analysis of NKCCl protein expression in sections of 
breast ductal carcinoma tissues. Invasive ductal carcinoma tissue sections from two separate 
donors is shown (A and B); the top panel shows NKCCl immunostaining, the bottom panel 
shows an adjacent hematoxylin and eosin stained section. Note the NKCCl immunoreactmty on 
the plasma membrane of the carcinoma cells. 

Figure 6: shows immunohistochemical analysis of NKCCl protein expression in sections of lung 
adenocarcinoma tissue. The top panel (A) shows NKCCl immunostaining, the bottom panel 
shows an adjacent control section. 

Figure 7: shows immunohistochemical analysis of NKCCl protein expression in sections of 
pancreatic adenocarcinoma tissue. The top panel (A) shows NKCCl immunostaining, the bottom 
panel shows an adjacent control section. 

5 

Examj aleJiIdentifi catiog nf NKCCl 

NKCCl was isolated from breast cancer (MDA MB 486/BT474) and pancreatic (HPAFH) cell lines. 

Cells were grown to confluence in 15cm 2 cell culture dishes before fractionation. Before harvest 
.0 and extraction, the cells (approx. 2x10* cells) were washed three times with PBS-CM. Cells were scraped 
from culture dishes in ice-cold PBS-CM (5ml per 2xl0 8 cells) using a plastic cell lifter. The cells were, 
then centrifuged at 1000 x g for 5 min at +4°C. The supernatant was removed and the cells were 
resuspended in lOmls of homogenisation buffer (250mM Sucrose in lOmM HEPES, ImMEDTA ImM 
Vanadate, 0.02% Azide) followed by centrifugation at 1000 x g for 5 min at +4-C. The supernatant was 
25 removed. The cell pellet was then resuspended in 5 x packed cell volume with homogenisation buffer 
plus protease inhibitors (Sigma). A ball bearing homogeniser (BBH) (8.002mm ball) was chilled and 
rinsed with homogenisation buffer. The cell suspension was taken up in a 2ml syringe and attached to 
one side of the BBH. Another syringe was attached to the other side of the BBH. The cell mixture was 
fed through the chamber up to five times. Breakage of the cells was monitored using a microscope and 
30 when the cells were sufficiently lysed the resulting mixture was centrifuged at 1000 x g for 5 mm at 

+4 C The resulting supernatant (PNS) was retained and 1ml of homogenisation buffer was added to the 
nuclear pellet followed by centrifugation at lOOOx g for 5 min. The latter two fractions were pooled and 
centrifuged at 3000 x g for 10 min at +4°C. The 3000 x g supernatant was layered onto a 2ml 60% 
35 sucrose cushion in SW40 or SW60 tube and centrifuged at 100 000 x g for 45 min with slow acceleration 
and deceleration. The crude plasma membrane was evident as a discrete layer on top of the sucrose 
cushion. The upper layer was removed (cytosol) and the plasma membrane was collected using a pasteur 
pipette. The % sucrose of crude plasma membrane fraction was determined using a refractometer. The 
membrane preparation was diluted with HEPES buffer to reduce the sucrose content to below 15%. The 
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crude plasma membrane preparation was layered on preformed 15 to 60% sucrose gradient in SW40 tube 
and spun at 100 000 x g for 17 h with slow acceleration and deceleration. 

The sucrose gradient was fractionated using the gradient unloader (speed 0.5, distance 2.5, 
fractions 35). The protein content of the fractions was measured and 10 micrograms of protein was run 
on a 4-20% acrylamide ID gel (Novex) and subject to western blotting with antibodies to Transferrin 
Receptor, Oxidoreductase II and Calnexin. 

Plasma membrane fractions that had transferrin immunoreactivity but no oxidoreductase II or 
calnexin immunoreactivity were identified. These sucrose fractions were pooled and diluted at least four 
times with lOmM HEPES, ImM EDTA ImM Vanadate, 0.02% Azide. The diluted sucrose fraction was 
added to a SW40 or SW60 tube and centrifuged at 100 000 x g for 45 min with slow acceleration and 
deceleration. The supernatant was removed from the membrane pellet and the pellet washed three times 
with PBS-CM. The membrane pellet was solubulized in 2% SDS in 63mM TrisHCl, pH 7.4. A protein 
assay was performed followed by the addition of mercaptoethanol (2% final), glycerol (10%) and 
bromopheneol blue (0.0025% final). The extracted protein sample was finally solubilized in ID lysis 
buffer to a final protein concentration of 1 microgram/microlitre and the proteins separated by ID PAGE. 

Mass Spectrometry 

Proteins excised from the ID gel were digested with trypsin and analysed by MALDI-TOF-MS 
(Voyager STR, Applied Biosystems) using a 337 nm wavelength laser for desorption and the reflectron 
mode of analysis. Selected masses for NKCC1 were further characterised by tandem mass spectrometry 
using a QTOF-MS equipped with a nanospray ion source, (Micrornass UK Ltd.). Prior to MALDI 
analysis the samples were desalted and concentrated using CI 8 Zip Tips™ (Millipore). Samples for 
tandem mass spectrometry (MS) were purified using a nano LC system (LC Packings, Amsterdam, The 
Netherlands) incorporating CI 8 SPE material. 

Using the SEQUEST search program (Eng et al., 1994, J. Am. Soc. Mass Spectrom. 5:976-989), 
uninterpreted tandem mass spectra of tryptic digest peptides were searched against a database of public 
domain proteins constructed of protein entries in the non-redundant database held by the National Centre 
for Biotechnology Information (NCBI). This database is accessible at http://www.ncbi.nlm.nih.gov/ and 
also constructed of Expressed Sequence Tags entries (http://www.ncbi.nlm.nih.gov/dbEST/index.html). 
As a result of sequence database searching tandem amino acid sequences were found to match a 
SwissProt accession number: P55011 (Bumetanide-sensitive sodium-(potassium)-chloride cotransporter 
1), see Figure 1, matched tandem sequences are shown in bold. 

Alternatively sequences were identified using peptide mass data derived from mass spectrometer 
analysis and the MOWSE database search procedure. Peptide mass information can provide a 'fingerprint 1 
signature sufficiently discriminating to allow for the unique and rapid identification of unknown sample 
proteins, independent of other analytical methods such as protein sequence analysis. Practical experience 
has shown that sample proteins can be uniquely identified using as few as 3-4 experimentally determined 
peptide masses when screened against a fragment database derived from over 50,000 proteins (D. J.C. 
Pappin, P. Hojrup and AJ. Bleasby *Rapid Identification of Proteins by Peptide-Mass Fingerprinting 1 . 
Current Biology (1993), vol 3, 327-332. and 
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http://www.hgrap.nirc.ac.ukaioinfonnatics/Webapp/mowseO. The version of the code used for a 
MOWSE database search had the following modifications: the size of the parent protein is not included in 
the calculation such that large proteins such as titin no longer bias the score; instead the theoretical 
frequency of a peptide of mass (x) is estimated using the mass (x) of the peptide and the mean mass of an 
amino acid whilst allowing for a probability of 0.2 for a missed internal cleavage (by trypsin) and 0.1 for 
the probability of the occurrence of a proteolytic cleavage site. 

The score assigned to a match on a peptide mass (x) is the logarithm of the probability of rinding 
such a match at random and is inversely proportional to the frequency of fragments of that mass. 

The score is thus calculated using the linear regression formula found by plotting the score for a 
match on peptide mass (x) against the mass (x). Two mass matches to predicted trypsin fragments were 
identified in this manner for NKCC1, which also matched to P5501 1, see Figure 1, the sequences which 
were identified through mass matching are shown in italics and underlined. The accuracy of this method 
was 20 ppm. 

Example 2i Exnression of NKCf.1 mRNA i n human tissues 

Real time quantitative RT-PCRwas used (Heid, C.A., Stevens, J., Livak, K.J. & Williams, P.M. 
Real time quantitative PCR. Getwme Res. 6, 986-994 (1996); Morrison, T.B., Weis, J J. & Wittwer, C.T. 
Quantification of low-copy transcripts by continuous SYBR Green I monitoring during amplification. 
Bioteckniques 24, 954-958 (1998)) to analyse the distribution of NKCC1 mRNA in normal human tissues 
20 (Fig 2), donor matched tumour and adjacent normal tissues from seven breast cancer patients, (Fig 3), 
normal and tumour-derived pancreatic cell lines and tissue specimens (Fig 4). 

Quantification of NKCC1 mRNA by RT-PCR 

Real-time RT-PCR was used to quantitatively measure NKCC1 expression in normal human tissue 
25 mRNAs (Clontech), donor matched tumour and adjacent normal tissues from breast cancer patients, 

normal and tumour-derived pancreatic cell lines and tissue specimens. Ethical approval for the normal 

and tumour tissue samples was obtained at surgery (University of Oxford, UK). The primers used for 

PCR were as follows: 

sense, 5' cacctactacctgcgcaccttc 3* , (SEQ ID NO: 3) 
30 antisense, 5' gaccacagcatctctggttgga 3'. (SEQ ID NO: 4) 

Reactions contained 5ng cDNA (prepared using Superscript first strand synthesis for RT-PCR kit 
(Life Technologies)), SYBR green sequence detection reagents (PE Biosystems), sense and antisense 
primers, and were assayed on an ABI7700 sequence detection system (PE Biosystems). The PCR 

35 conditions were 1 cycle at 50°C for 2 min, 1 cycle at 95°C for 10 min, and 40 cycles of 95°C for 15sec, 
65°C for 1 min. The accumulation of PCR product was measured in real time as the increase in SYBR 
green fluorescence, and the data were analysed using the Sequence Detector program vl.6.3 (PE 
Biosystems). Standard curves relating initial template copy number to fluorescence and amplification 
cycle were generated using the amplified PCR product as a template, and were used to calculate NKCC1 

40 copy number in each sample. 
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Overall the distribution of NKCC1 mRNA was low in normal tissues, with the highest levels of 
mRNA expression seen in mammary, prostate, testis and brain tissues (Figure 1). 

The expression of NKCC1 mRNA in clinical breast carcinoma tissues was compared with the 
matched adjacent normal tissue from 7 breast cancer patients (Figure 3). NKCC1 expression was 
increased in all of the tumour samples, relative to their matched control tissue, with six of the seven 
samples showing a greater than 6-fold increase in expression. 

NKCC1 mRNA expression was also analysed in normal and tumour pancreatic tissues and cell 
lines (Figure 4). NKCC1 expression was found to be significantly higher in the pancreatic tumour 
tissue(mean = 69 copies ng' 1 cDNA) and tumour-derived cell lines (mean = 1013 copies ng' 1 cDNA) than 
in the corresponding normal pancreatic tissues and cell lines (mean = 16 and 5 copies per ng" 1 cDNA 
respectively). 

Example 3; Immunohistochemical Analysis of NKCC1 in Br east. Lung and Pancreatic Cancer 
Tissues 

To further illustrate the involvement of NKCC1 in breast cancer, immunohistochemistry with a 
specific anti-NKCCl antibody was used to investigate NKCC1 protein expression in sections of multiple 
donor breast ductal carcinoma tissues and in lung adenocarcinoma and pancreatic cancer tissues. 

Antibody generation. 

The anti-NKCCl polyclonal antibody was raised in guinea pigs immunised with 2 specific peptides 
(Immune Systems Limited, UK). Peptide sequences were chosen for synthesis based on plots of 
hydrophobicity, antigenicity, surface probability, and weak homology to other known protein family 
members. Peptides were synthesised using Fmoc chemistry with a cysteine residue added to the end of 
each to enable specific thiol-reactive coupling of Keyhole Limpet Haemocyanin prior to immunisation. 
The peptides used were; SKKPKGFFGYKC (SEQ DO NO: 5) and SGESEPAKGSEEAKGC (SEQ ID 
NO: 6). The antibodies were affinity purified using columns of the above immobilised peptides. 

NKCC1 Immunohistochemistry in Breast Carcinoma tissue 

Immunohistochemical analysis was carried out on formalin-fixed paraffin-embedded tissue microarrays 
containing 1mm sections of breast carcinoma tissue from 55 donors as well as 20 sections of various 
normal tissues (Clinomics Laboratories Inc., 165 Tor Court, Pittsfield, MA 01201). Slides were 
deparafinised by two 5 min washes in xylene then rehydrated through successive graded ethanol solutions 
and washed for 5 mins in PBS. Antigen retrieval was achieved by immersing the slides in 0.01M citrate 
buffer (pH 6) and microwaving for 10 mins at full power (950W). In addition, detection with the 
antibody was improved by protease treatment of the tissue with Autozyme (AbCam) for 10 min at room 
temperature. The tissue was blocked in 10% donkey serum/PBS for 1 hour before addition of 1.5 ug/ml 
primary polyclonal antibody (in 2.5% donkey serum/PBS). Following 3 washes in PBS the tissue 
sections were incubated with biotin-conjugated secondary antibodies (Biotin-SP-conjugated AffimPure 
Donkey anti-guinea pig, Jackson ImmunoResearch) diluted at 1:200 (2.5 ug/ml in 2.5% donkey 
serum/PBS) for 1 hour. Slides were washed 3 times in PBS and the tissue incubated with Streptavidin- 
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HRP (Jackson ImmunoResearch) diluted 1:100 (5 ug/ml in 2.5% donkey serum/PBS), followed by three 5 
min washes in PBS. Antibody signal was detected using DAB substrate solution (Dako Ltd.) according 
to the manufacturers' instructions. An adjacent tissue array was counterstained for hematoxylin and eosm 
(Dako Ltd.) and images were captured by a digital camera attached to a light microscope. 

In the paired images shown in Figure 5 A and Figure 5B the upper panel demonstrates NKCCl 
protein immunostaining in the breast cancer tissue, the lower panel .shows a hematoxylin and eosm 
stained adjacent tissue section demonstrating the tissue histology (carcinoma cells have large nuclei) 

It is clearly apparent from these Figures that NKCCl is specifically and highly expressed in the 
ductal carcinoma cells of the breast cancer tissue (compare with adjacent breast tissue). In total 55 breast 
cancer donor tissues were examined for NKCCl immunoreactivity. Of these 7 demonstrated very high 
NKCCl staining, 30 exhibited high/moderate staining, 13 exhibited weak staining, and 5 showed no 
staining in the carcinoma cells. 

NKCCl Immunohistochftmistrv in Lu np and Pancreatic Cancer tissues 

Immunohistochemical analysis was carried out on formalin-fixed paraffin embedded tissue microarrays 
containing 1 mm sections of carcinoma tissues from a wide range of donors (obtained from Clinomics 
Laboratories Inc., 165 Tor Court, Pittsfield, MA 01201). Slides were deparafinized by two five min 
washes in xylene then rehydrated through successive graded ethanol solutions and washed for 5 mm in 
PBS. Antigen retrieval was achieved by immersing the slides in 0.01M citrate buffer (pH 6) and 
microwaving for 10 min at full power (950W) then treating the tissue with pepsin (1 mg/ml) for 1 mm at 
room temperature at pH 2. The tissue was blocked in 10% donkey serum/PBS for 1 hour before addition 
of 1 ug/ml primary anti-NKCCl polyclonal antibody (in 2.5% donkey serum) for an hour. Following 
three washes in PBS the tissue sections were incubated with biotin-conjugated secondary antibodies 
(Biotin-SP-conjugated AffiniPure Donkey anti-guinea pig IgG, Jackson ImmunoResearch) diluted at 
1-200 (2 5 ug/ml in 2.5% donkey serum/PBS) for 1 hour. Slides were washed 3 times in PBS and the 
tissue incubated with Streptavidin-HRP (Jackson ImmunoResearch) diluted 1:100 (5 ug/ml in 2.5% 
donkey serum/PBS) for an hour, followed by three 5 min washes in PBS. Antibody signal was detected 
using DAB substrate solution (Dako Ltd.) according to the manufacturer' s instructions. 

In the paired images (A + B) shown in Figure 6 the upper panel (A) demonstrates NKCCl protein 
immunostaining in the lung cancer tissue, the lower panel (B) .shows a control adjacent tissue section. 

In the paired images (A + B) shown in Figure 7 the upper panel (A) demonstrates NKCCl protein 
immunostaining in the pancreatic cancer tissue, the lower panel (B) shows a control adjacent tissue 
section. 

Thus NKCCl shows a restricted pattern of expression in normal human tissues, and is elevated 
in breast cancer tissues, lung cancer tissues, pancreatic cancer tissues and cancer cell lines, suggesting that 
this protein has potential as a therapeutic target for breast, lung and/or pancreatic cancer. 
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The. present invention is not to be limited in terms of the particular embodiments described in this 
application, which are intended as single illustrations of individual aspects of the invention. Functionally 
equivalent methods and apparatus within the scope of the invention, in addition to those enumerated 
herein, will be apparent to those skilled in the art from the foregoing description and accompanying 
drawings. Such modifications and variations are intended to fall within the scope of the appended claims. 

When a reference is made herein to a method of treating or preventing a disease or condition 
using a particular agent or combination of agents, it is to be understood that such a reference is intended 
to include the use of that agent or combination of agents in the preparation of a medicament for the 
treatment or prevention of the disease or condition. 

Preferred features of each aspect of the invention are as for each of the other aspects mutatis 
mutandis. 

The contents of each reference, patent and patent application cited in this application is hereby 
incorporated by reference in its entirety. 
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CLAIMS 

1 A method of screening for and/or diagnosis of breast, lung and/or pancreatic cancer in a subject 
and/or monitoring the effectiveness of breast, lung and/or pancreatic cancer therapy, which method 
comprises the step of detecting and/or quantifying in a biological sample obtained from said subject 
(i) an NKCC1 polypeptide which: 

a) comprises or consists of the amino acid sequence shown in Figure 1 (SEQ ID 

NO:l); 

b) is a derivative having one or more amino acid substitutions, modifications, 
deletions or insertions relative to the amino acid sequence shown in Figure 1 
(SEQ ID NO:l) which retains the biological activity of NKCC1; or 

c) is a fragment of a polypeptide having the sequence shown in Figure 1 (SEQ.ID 
NO: 1), which is at least ten amino acids long and has at least 70% sequence 
identity over the length of the fragment; and/or 

(ii) a nucleic acid molecule which: 

d) comprises or consists of the DNA sequence shown in Figure 1 (SEQ ID NO:2) or 

its RNA equivalent; 

e) a sequence which codes for a derivative or fragment of an amino acid molecule as 
defined in (a) or (b) (SEQ ID NO:2). 

f) a sequence which is complementary to the sequences of d) or e); 

g ) a sequence which codes for the same polypeptide, as the sequences of d) or e); or 

h) a sequence which shows substantial identity with any of those of d),e),f) org). 

i) is a fragment of d), e), f), g) or h), which is at least 8 nucleotides in length. 

2. An antibody, functionally-active fragment, derivative or analogue thereof, that specifically binds to a 
polypeptide as defined in claim l(i). 

3. The method according to claim 1, wherein the polypeptide is detected and/or quantified using an 
antibody that specifically binds to a polypeptide as defined in claim l(i). 

4 An antibody according to claim 2 or the method of claim 3; wherein the antibody is monoclonal, 
polyclonal, chimeric, bispecific, humanised or is conjugated to a therapeutic moiety, a second antibody or a 
fragment thereof, a cytotoxic agent or a cytokine. 



5 A method for the prophylaxis and/or treatment of breast, lung and/or pancreatic cancer 
subject, which comprises administering to said subject a therapeutically effective amount of: 
0 a polypeptide as defined in claim l(i), 

ii) a nucleic acid molecule as defined in claim l(ii), or 

iii) a antibody as defined in claim 2 or 4* 



in a 
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6. The use of: 

i) a polypeptide as defined in claim l(i), 

ii) a nucleic acid molecule as defined in claim l(ii), or 

iii) a antibody as defined in claim 2 or 4. 

5 in the preparation of a composition for use in the prophylaxis and/or treatment of breast, lung 

and/or pancreatic cancer. 

7. i) a polypeptide as defined in claim l(i), 

ii) a nucleic acid molecule as defined in claim l(ii), or 
10 iii) a antibody as defined in claim 2 or 4, 

for use in the prophylaxis and/or treatment of breast, lung and/or pancreatic cancer. 

8. The use as claimed in claim 6i); wherein the composition is a vaccine. 

15 9. The method of claim 5 or the use of claim 6 or 7, where the nucleic acid inhibits the expression of 
the polypeptide as defined in claim l(i). 

10. A method of screening for agents that interact with a polypeptide as defined in claim l(i), said 
method comprising: 

20 (a) contacting said polypeptide with a candidate agent; and 

(b) determining whether or not the candidate agent interacts with said polypeptide. 

1 1 . The method according to claim 10, wherein the determination of interaction between the candidate 
agent and the polypeptide comprises quantitatively detecting binding of the candidate agent and said 

25 polypeptide. 



12. A method of screening for agents that modulate 

i) the expression of a polypeptide as defined in claim l(i), 

ii) the expression of a nucleic acid molecule as defined in claim l(ii), or 
30 iii) an antibody as defined in claim 2, 

said method comprising: 

a) comparing the expression or activity of said polypeptide, the expression of said nucleic 
acid molecule or the expression of said antibody, in the presence of a candidate agent with the 
expression or activity of said polypeptide, the expression of said nucleic acid molecule or the 

35 expression of said antibody, in the absence of the candidate agent or in the presence of a control 

agent; and 

b) determining whether the candidate agent causes the expression or activity of said 
polypeptide, the expression of said nucleic acid molecule or the expression of said antibody, to 
change. 



40 
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13. The method of claim 12 wherein the expression or activity level of said polypeptide, the 
expression level of said nucleic acid molecule or the expression of said antibody is compared with a 
predetermined reference range. 

5 14. The method of claims 12 or 13 wherein step b) additionally comprises selecting an agent which 
modulates the expression or activity of said polypeptide, the expression of said nucleic acid molecule or 
the expression of said antibody, for further testing or therapeutic or prophylactic use as an anti- breast, 
lung and/or pancreatic cancer agent 

10 15. An active agent identified by the method of any one of claims 10 to 14 which causes the 

expression or activity of said polypeptide, the expression of said nucleic acid molecule or the expression 
of said antibody, to change. 

16. The use of an active agent which modulates the expression or activity of a polypeptide as defined 
15 in claim l(i) or the expression of a nucleic acid as defined in claim l(ii) in the manufacture of a 

composition for the prophylaxis and/or treatment of breast, lung and/or pancreatic cancer. 

17. An active agent which modulates the expression or activity of a polypeptide as defined in claim 
l(i) or the expression of a nucleic acid as defined in claim l(ii) for use in the prophylaxis and/or treatment 

20 of breast, lung and/or pancreatic cancer. 

18. A method of prophylaxis and/or treatment of breast, lung and/or pancreatic cancer, which 
comprises administering to said subject a therapeutically effective amount of an active agent which 
modulates the expression or activity of a polypeptide as defined in claim l(i) or the expression of a 

25 nucleic acid as defined in claim l(ii). 
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PROTEIN INVOLVED IN CANCER 
Abstract 



The present invention relates to the use of a protein (NKCC1) in the diagnosis, screening, 
treatment and prophylaxis of breast, lung and/or pancreatic cancer, compositions comprising the protein, 
including vaccines, and antibodies that are immunospecific for the protein. 
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Figure 1 

ggtggcctctgtggccgtccaggctagcggcggcccgcaggcggcggggagaaagactct 60 
ctcacctggtcttgcggctgtggccaccgccggccaggggtgtggagggcgtgctgccgg 120 
agacgtccgccgggctctgcagttccgccgggggtcgggcagctatggagccgcggccca 180 

cggcgccctcctccggcgccccgggactggccggggtcggggagacgccgtcagccgctg 240 
APSSGAPGLAGVGETPSAAA 

cgctggccgcagccagggtggaactgcccggcacggctgtgccctcggtgccggagg^ 300 
L AAARVEIiPGTAVPSVPEUA 

ctgcgcccgcgagccgggacggcggcggggtc^ 
acgggctgggcagacccttggggcccaccccgagccaga^ 

tttccgagaacgccgggcgggccgctgctgcggcggcggcggcggcggcggcagcggcgg 
S ENAGRAAAAA.AAAAAAAAA 

cggctggtgctggggcgggggccaagcagacccccgcggacggggaagccagcggcgaga 
AGAGAGAKQTPADGKASGES 

gcgagccagctaaaggcagcgaggaagccaagggccgcttocgcgtgaacttcgtggacc 
EPAKGSEEAKGRFRVNFVDF 

cagctgcctcctcgtcggctgaagacagcctgtcagatgctgccggggtcggagtcgacg 660 
AASSSAEDSLSDA AGVGVDG 

ggcccaacgtgagcttccagaacggcggggacacggtgctgagcgagggcagcagcctgc 720 
P NVSFQNGGDTVLSEGSSLH 

actccggcggcggcggcggcagtgggcaccaccagcactactattatgatacccacacca 780 
SGGGGGSGHHQHYYYD l ni 

acacctactacctgcgcaccttcggccacaacaccatggacgctgtgcccaggatcgatc 840 
TYYLRTFGHNTMDAVPRIDH 

actaccggcacacagccgcgcagctgggcgagaagctgctccggcctagcctggcggagc 900 
YRHTAAQI»GEKI»LRPSLAEL 

tccacgacgagctggaaaaggaaccttttgaggatggctttgcaaatggggaagaaagta 960 
HDELEKEP FEDGFANGEES l 

ctccaaccagagatgctgtggtcacgtatactgcagaaagtaaaggagtcgtgaagtttg 1020 
PTRDAVVTYTAESKGVVKFG 

gctggatcaagggtgtattagtacgttgtatgttaaacatttggggtgtgatgcttttca X080 
WIKGVLVRCMLNIWGVMIiU 4. 

ttagattgtcatggattgtgggtcaagctggaataggtctatcagtccttgtaataatga 1140 
RLSW1VGQAGIGLSVLVIMM 

tggccactgttgtgacaactatcacaggattgtctacttcagcaatagcaactaatggat 1200 
A.TVVTTITGLSTSAIATNGF 

ttgtaagaggaggaggagcatattatttaatatctagaagtctagggccagaatttggtg 1260 
VRGGGAYYLISRSLGPEFGG 

gtgcaattggtctaatcttcgcctttgccaacgctgttgcagttgctatgtatgtggttg 1320 
AIGLIFAFANAVAVAMYVVG 

gatttgcagaaaccgtggtggagttgcttaaggaacattccatacttatgatagatgaaa 1380 
FAETVVELLKEHSILMIDEX 
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tcaatgatatccgaattattggagccattacagtcgtgattcttttaggtatctcagtag 1440 
NDIRIIGAITVVILLGISVA 

ctggaatggagtgggaagcaaaagctcagattgttcttttggtgatcctacttcttgcta 1500 
GMEWEAKAQIVIiLVILLLAI 

ttggtgatttcgtcataggaacatttatcccactggagagcaagaagccaaaagggtttt 1560 
GDPVIGTP I P I* E SK'KPKGFF 

ttggttataaatctgaaatatttaatgagaactttgggcccgattttcgagaggaagaga 1620 
G Y K SEXFNENFGPnFR BEET 

ctttcttttctgtatttgccatcttttttcctgctgcaactggtattctggctggagcaa 1680 
FFSVFAIFFPAATG ILAGAN 

atatctcaggtgatcttgcagatcctcagtcagccatacccaaaggaacactcctagcca 1740 
I SGDLADPQSAX PKGTIiLAI 

ttttaattactacattggtttacgtaggaattgcagtatctgtaggttcttgtgttgttc 1800 
LITTLVYVGIAVSVGSCVVR 

gagatgccactggaaacgttaatgacactatcgtaacagagctaacaaactgtacttctg 1860 
DATGNVNDTIVTELTNCTSA 

cagcctgcaaattaaactttgatttttcatcttgtgaaagcagtccttgttcctatggcc 1920 
ACKLNFDFSSCESSPCSYGLi 

taatgaacaacttccaggtaatgagtatggtgtcaggatttacaccactaatttctgcag 1980 
MNNFQVMSMVSGFTPLISAG 

gtatattttcagccactctttcttcagcattagcatccctagtgagtgctcccaaaatat 2040 
IFSATLSSALASLVSAPKIF 

ttcaggctctatgtaaggacaacatctacccagctttccagatgtttgctaaaggttatg 2100 
QALCKDNIYPAFQMFAKGY'G 

ggaaaaataatgaacctcttcgtggctacatcttaacattcttaattgcacttggattca 2160 
KN NEPLRGYIIiTFL XALGFX 

tcttaattgctgaactgaatgttattgcaccaattatctcaaacttcttccttgcatcat 2220 
I* IAELNVIA P I I SNFFLASY 

atgcattgatcaatttttcagtattccatgcatcacttgcaaaatctccaggatggcgtc 2280 
ALINFSVFHASLAKSPGWRP 

ctgcattcaaatactacaacatgtggatatcacttcttggagcaattctttgttgcatag 2340 
AFKYYNMWISLLGAILCCIV 

taatgttcgtcattaactggtgggctgcattgctaacatatgtgatagtccttgggctgt 2400 
MFVINWWAALLTYVIVLGLY 

atatttatgttacctacaaaaaaccagatgtgaattggggatcctctacacaagccctga 2460 
I YVTYKKPDVNWGS STQALT 

cttacc tgaatgcac tgcagcattcaat tcgtc t ttctggagtggaagaccacgtgaaaa 2520 
YLNADQHS IRLSGVEDHVKN 

actttaggccacagtgtcttgttatgacaggtgctccaaactcacgtccagctttacttc 2580 
FRPQCLVMTGAPNSRPALLH 

atcttgttcatgatttcacaaaaaatgttggtttgatgatctgtggccatgtacatatgg 2640 
ltVHT>FTKNV6XtMXCGBVHMQ 

gtcctcgaagacaagccatgaaagagatgtccatcgatcaagccaaatatcagcgatggc 2700 
PjRJ?QAMKEMS I DQAKYQRWL 

ttattaagaacaaaatgaaggcattttatgctccagtacatgcagatgacttgagagaag 2760 
I KNKMKAFYAPVHADDLREG 

gtgcacagtatttgatgcaggctgctggtcttggtcgtatgaagccaaacacacttgtcc 2820 
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AQYLMQAAGLGRMKPNTLVL 

TrrrrnTrriTTT?WTTE 2880 

tatttcatgatgcttttg^ 29 AO 

gc f c r g rT s fT a l 9 T 9 i aa Y at i 9t rT g s c T" ta D at l ct s CC K 3060 

aaccactcagtgaaaaacca^^ 3120 
ctcaaccactgttgaaa^ 3180 
aaaagcttcttgaa^^ 

tctggtggctttttgatgatggaggtttgaccttattgataccttaccttctgacgacca 3300 
WWLFDDGGI-TIiLIPYIjIj.1 

agaaaaaatggaaagactgtaagatcagagtatecattggtggaaagataaacagaatag 3360 
K KWKDCK I R V F I GGK INK**' 

accatgaccggagagcgatggctactttgcttagcaagttccggatagacttttctgata 3420 
HDRRAMATLL SKFK1U-" 

t T g rT a i ga r t f c r t T c T a r a r g rrr t f t f a r" 

aaa it ca tt gagcca,acagac t tca t gaaga t ga t aaagagcaa r 
tg aaag E aaga T accatggcgaa t aacag^ 
cataccggcagatcaggttaaatgag^ 
ttgtcatgagt^ 

ggttagaagctctatctaaggacctaccaccaatcctcctagtt^ 3780 

gtgtccttaccttctattcataaatgttctatacagtggacagccctccagaatggtact 3840 

V L T F Y s * 
tcagtgcctagtgtagtaacctgaaatcttcaatgacacattaacatcacaatggcgaat 3900 

ggtgacttttctttcacgatttcattaatttgaaagcacacaggaaagcttgctccattg 

ataacgtgtatggagacttcggttttagtcaattccatatctcaatcttaatggtgattc 

ttctctgttgaactgaagtttgtgagagtagttttcctttgctacttgaatagcaataaa 

agcgtgttaactttttgg 
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Figure 2 
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Figure 4 
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Figure SB 
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Figure 7 
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